AROUND THE CIRCULAR LAW 
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Abstract. These expository notes are centered around the circular law theorem, which 
states that the empirical spectral distribution of a n x n random matrix with i.i.d. entries 
of variance 1/n tends to the uniform law on the unit disc of the complex plane as the 
dimension n tends to infinity. This phenomenon is the non-Hermitian counterpart of the 
semi circular limit for Wigner random Hermitian matrices, and the quarter circular limit 
for Marchenko-Pastur random covariance matrices. We present a proof in a Gaussian 
case, due to Silverstein, based on a formula by Ginibre, and a proof of the universal 
case by revisiting the approach of Tao and Vu, based on the Hermitization of Girko, the 
logarithmic potential, and the control of the small singular values. Beyond the finite 
variance model, we also consider the case where the entries have heavy tails, by using 
the objective method of Aldous and Steele borrowed from randomized combinatorial 
optimization. The limiting law is then no longer the circular law and is related to the 
Poisson weighted infinite tree. We provide a weak control of the smallest singular value 
under weak assumptions, using asymptotic geometric analysis tools. We also develop a 
quaternionic Cauchy-Stieltjes transform borrowed from the Physics literature. 
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These expository notes are split in seven sections and an appendix. Section [T] introduces 
the notion of eigenvalues and singular values and discusses their relationships. Section [2] 
states the circular law theorem. Section [3] is devoted to the Gaussian model known as 
the Complex Ginibre Ensemble, for which the law of the spectrum is known and leads to 
the circular law. Section [J] provides the proof of the circular law theorem in the universal 
case, using the approach of Tao and Vu based on the Hermitization of Girko and the 
logarithmic potential. Section presents some models related to the circular law and 
discusses an algebraic-analytic interpretation via free probability. Section [H] is devoted to 
the heavy tailed counterpart of the circular law theorem, using the objective method of 
Aldous and Steele and the Poisson Weighted Infinite Tree. Finally, section [7] lists some 
open problems. The notes end with appendix [XI devoted to a novel general weak control of 
the smallest singular value of random matrices with i.i.d. entries, with weak assumptions, 
well suited for the proof of the circular law theorem and its heavy tailed analogue. 

All random variables are defined on a unique common probability space {i},A,¥). A 
typical element of J7 is denoted cj. Table [1] gathers most frequently used notations. 



The eigenvalues of yl G A4n(C) are the roots in C of its characteristic polynomial 
Pa{z) := det{A- zI). We label them Ai(A), . . . , A„(^) so that \Xi{A)\ >■■■> \Xn{A)\ 
with growing phases. The spectral radius is |Ai(A)|. The eigenvalues form the algebraic 
spectrum of A. The singular values of A are defined by 



The matrices A, A ,A* have the same singular values. The 2n x 2n Hermitian matrix 



has eigenvalues si{A), —si{A), . . . , Sn{A), —Sn{A). This turns out to be useful because the 
mapping A i— )■ Ha is linear in A, in contrast with the mapping A i— )• ^TAA" . Geometrically, 
the matrix A maps the unit sphere to an ellipsoid, the half-lengths of its principal axes 
being exactly the singular values of A. The operator norm or spectral norm of A is 



The rank of A is equal to the number of non-zero singular values. If A is non-singular 
then Si{A~'^) = s„_i(^)"^ for ah 1 < i < n and Sn{A) = si{A~'^)~^ = \\A^^\\2\2- 



1. Two KINDS OF SPECTRA 



SkiA) := Xk{VA¥) 
for all 1 < A; < n, where A* = A~^ is the conjugate-transpose. We have 

Sl{A) > ■ ■ ■> Sn{A) > 0. 




^||2-!>2 := max ||^x||2 = •Si(^) while Sn{A) 

ll^l|2 = l 



min \\Ax 
3;'ll2=i 




Figure 1. Largest and smallest singular values of ^ G A^2(IR)- Taken from |33) . 



Since the singular values are the eigenvalues of a Hermitian matrix, we have variational 
formulas for all of them, often called the Courant-Fischer variational formulas \82i Th. 
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log 
a := ■ ■ ■ 

lim and lim 
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A 

A-^ and A* 
Tr(^) and det(A) 
I (resp. In) 

A-z 
Sk{A) 
\k{A) 
Pa{z) 

/"A 

span(- • • ) 

(•,•) 

dist(t), V) 

supp 

\z\ and card(S) 

M, 

11^112^2 
11^112 

o(-) and O(-) 
D 

u,v,w 

H 
X 
G 
_1e 

d, d,A 

P{C) 
0'(C) 
C, c, co,ci 
X 

n 

M{m, E) 
and 



set of 2 X 2 matrices of the form 



with z G C and r; G 



natural Neperian logarithm function (we never use the notation In) 
the mathematical object a is defined by the formula • • • 

inferior and superior limit 
set of non- negative integers 1,2,... 
set of non-negative real numbers [0, oo) 

set of complex numbers with positive imaginary part 

'n 

complex number (0, 1) or some natural integer (context dependent) 
set of n X n matrices with entries in K 

set of n X m matrices with entries in K 

transpose matrix of matrix A (we never use the notation A' or M) 
conjugate matrix of matrix A or closure of set A 
inverse and conjugate-transpose of A 
trace and determinant of A 

identity matrix (resp. of dimension n) 

the matrix A ~ zl (here 2 G C) 

fc-th singular value of A (descending order) 

fc-th eigenvalue of A (decreasing module and growing phase order) 

characteristic polynomial of A at point z, namely det(A — zl) 

empirical measure built from the eigenvalues of A 

empirical measure built from the singular values of A 

logarithmic potential of jx at point z 

Cauchy-Stieltjes transform of /i at point z 

quaternionic transform of ^ at point q G IHI+ 

quaternionic transform of /ia at point q G IHI+ (i.e. Mj^^(g)) 

vector space spanned by the arguments ■ • • 

Scalar product in R" or in C" 

2-norm distance of vector v to vector space V 

orthogonal vector space of the vector space V 

support (for measures, functions, and vectors) 

module of z and cardinal of E 

2-norm of vector v in R" or in C" 

operator norm of matrix A for the 2-norm (i.e. spectral norm) 

Hilbert-Schmidt norm of matrix A (i.e. Schur or Frobenius norm) 

classical Landau notations for asymptotic behavior 

most of the time, diagonal matrix 

most of the time, unitary matrices 

most of the time, Hermitian matrix 

most of the time, random matrix with i.i.d. entries 

most of the time, complex Ginibre Gaussian random matrix 

indicator of set E 

differential operators \{dx — idy), \(dx + idy), di + dy on R^ 
set of probabilitj' measures on C integrating log |-| at infinity 
set of Schwartz-Sobolev distributions on C = R^ 
most of the time, positive constants (sometimes absolute) 
the random variable X follows the law (i 

the sequence (/Xn)^ tends weakly to ji for continuous bounded functions 
Gaussian law with mean vector m and covariance matrix E 
quarter circular law on [0, k] and circular law on {2 G C : \z\ < k} 



Table 1. Main frequently used notations. 



3.1.2]. Namely, denoting Qn,i the Grassmannian of all i-dimensional subspaces, we have 
sAA) = max min IIAxlln = max min iAx,y). 

EG0„,i xeE E,Fegn,i {x,y)eExF 

!l^ll2 = l ||,;||, = ||j/||2 = l 

Most useful properties of the singular values are consequences of their Hermitian nature 
via these variational formulas, which are valid on M" and on C". In contrast, there are no 
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such variational formulas for the eigenvalues in great generality, beyond the case of normal 
matrices. If the matrix A is normal (i.e. A* A = A* A) then for every 1 < i < n, 

s^{A) = \K{A)\. 

Beyond normal matrices, the relationships between the eigenvalues and the singular values 
are captured by a set of inequalities due to Weyl |154j^ . which can be obtained by using 
the Schur unitary triangularizatiord, see for instance |82^ Theorem 3.3.2 page 171]. 

Theorem 1.1 (Weyl inequalities). For every n x n complex matrix A and 1 < k < n, 

k k 

niA.(^)i<n.,(^). (1.1) 

i=l i=l 

The reversed form ]Xi=n-k+i ^ii^) — TVi=n-k+i every 1 < k < n can be 

deduced without much difficulty (exercise!). Equality is achieved for k = n and we have 

n n 

n |Afc(^)| = I det(^)| = V|det(A)||det(A*)| = | det(VZ4^)| = J] Sk{A). (1.2) 

k=l k=l 

By using majorization techniques which can be found for instance in [82\ Section 3.3], 
one may deduce from Weyl's inequalities that for every real valued function ip such that 
1 1—)- (/3(e*) is increasing and convex on [s„(^), si(A)], we have, for every 1 < k < n, 

k k 

j;V'(|A.(.4)|)< j;^(..(^)), (1.3) 

i=l i=l 

see [82\ Theorem 3.3.13]. In particular, taking k = n and ip{t) = t^ gives 

n n n 

Since si(-) = |Hl2_s.2 have for any A,B £ A^„(C) that 

si{AB) < si{A)si{B) and si{A + B) < si{A) + si{B). (1.5) 
We define the empirical eigenvalues and singular values measures by 

^ n ^ n 

^^'■= -Y.^>^k{A) and i^A--= -Y,6s,^A)■ 
k=l k=l 

Note that fiA and va are supported respectively in C and R+. There is a rigid determi- 
nantal relationship between fiA and va, namely from ()1.2p we get 



r 1 " 

/log|A|(i/iA(A) = -^log|A,(A)| 

i=i 



-log|det(A)| 
n 

n 
i=l 



log{s)diyAis). 



^In these notes, the word normal is always used in this way, and never as a synonym for Gaussian. 

^Horn ^D] showed a remarkable converse to Weyl's theorem: if a sequence si > • • • > s„ of non-negative 
real numbers and a sequence Ai , . . . , A„ of complex numbers of non increasing modulus satisfy to all Weyl's 
inequalities then there exists A £ A^„(C) with eigenvalues Ai, . . . , An and singular va.lu6s si, . . . , Sn- 

^If A G A^n(C) then there exists a unitary matrix U such that UAU* is upper triangular. 
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This identity is at the heart of the Hermitization technique used in sections H] and [6l 

The singular values are quite regular functions of the matrix entries. For instance, the 
Courant-Fischer formulas imply that the mapping A i— t- {si{A), . . . ,s„(A)) is 1-Lipschitz 
for the operator norm and the norm in the sense that for any A,B^ A^„(C), 

max \s^{A) - Si{B)\ < si{A- B). (1.6) 

l<i<n 

Recah that Mn{C) or Mn(M) are Hilbert spaces for the scalar product A - B = Tr{AB*). 
The norm ||-||2 associated to this scalar product, called the trace nor satisfies to 



n „ 

A\\l = Tr{AA*) = Si{Af = n dvA{s). 
i=i •' 



(1.7) 



The Hoffman- Wielandt inequality for the singular values states that for all A,B G A^„,(C), 

n 

Y^{sM)-Si{B)f<\\A-B\\l. (1.8) 



i=l 



In other words the mapping A i— )• {si{A), . . . , Sn{A)) is 1-Lipschitz for the trace norm and 
the Euclidean 2-norm. See [82l equation (3.3.32)] and [811 Theorem 6.3.5] for a proof. 

In the sequel, we say that a sequence of (possibly signed) measures (?7n)„>i on C (re- 
spectively on R) tends weakly to a (possibly signed) measure rj, and we denote 

when for all continuous and bounded function / : C — )• M (respectively / : M — )• M), 

lim I f dr]n= if drj. 

This type of convergence does not capture the behavior of the support and of the moment^. 

Example 1.2 (Spectra of non-normal matrices). The eigenvalues depend continuously on 
the entries of the matrix. It turns out that for non-normal matrices, the eigenvalues are 
more sensitive to perturbations than the singular values. Among non-normal matrices, 
we find non-diagonalizable matrices, including nilpotent matrices. Let us recall a striking 
example taken from [138j and [11, Chapter 10]. Let us consider A, B G 7W„(M) given by 



A 



/O 1 

1 



\0 



0\ 


1 

0/ 



and B 



/O 







1 

1 









1 

0/ 



where (k„) is a sequence of positive real numbers. The matrix A is nilpotent, and B is a 
perturbation with small norm (and rank one!): 

rank(^ — B) = 1 and \\A — -B||2_s>2 ~ '^n- 

We have Ai(yl) = • • • = \^^[A) = and thus 

IJ.A = So- 

In contrast, B^ = KnI and thus Xk{B) = t^^e^^'"'^/^ for all 1 < k < n which gives 

/ifi Uniformjz S C : \z\ =1} 



^Also known as the Hilbert-Schmidt norm, the Schur norm, or the Frobenius norm. 
^Note that for empirical spectral distributions in random matrix theory, most of the time the limit is 
characterized by its moments, and this allows to deduce weak convergence from moments convergence. 
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as soon as Hn — ^ 1 (this allows Kn — )• Oj. On the other hand, from the identities 

AA* =diag(l,...,l,0) and BB* = diag{l, . . . ,1, k^) 

we get si{A) = ■ ■ ■ = s„_i(^) = 1, s„(^) = and si{B) = ■ ■ ■ = Sn^i{B) = 1, Sn{B) = k„ 
for large enough n, and therefore, for any choice of Kn, since the atom k„ has weight 1/n, 

VA ~^ md vb 6i. 

This example shows the stability of the limiting distribution of singular values under an 
additive perturbation of rank 1 of arbitrary large norm, and the instability of the limiting 
eigenvalues distribution under an additive perturbation of rank 1 of arbitrary small norm. 

Beyond square matrices, one may define the singular values si{A), . . . , Sm{A) of a rect- 
angular matrix A S A^m,n(C) with m < n by Si{A) := AA*) for every 1 < i < m. 
The famous Singular Value Decomposition (SVD, see [82l Th. 3.3.1]) states then that 

A = UDV* 

where U and V are the unitary matrices of the eigenvectors of A A* and A* A and where 
D = diag(si(A), . . . , Sn{A)) is a m x n diagonal matrix. The SVD is at the heart of many 
numerical techniques in concrete applied mathematics (pseudo-inversion, regularization, 
low dimensional approximation, principal component analysis, etc). Note that if A is 
square then the Hermitian matrix H := VDV* and the unitary matrix W := UV* form 
the polar decomposition A = WH of A. Note also that if Wi and W2 are unitary matrices 
then W1AW2 and A have the same singular values. 

We refer to the books [82] and [65] for more details on basic properties of the singu- 
lar values and eigenvalues of deterministic matrices. The sensitivity of the spectrum to 
perturbations of small norm is captured by the notion of pseudo-spectrum. Namely, for a 
matrix norm ||-|| and a positive real e, the (||-||, e)-pseudo-spectrum of A is defined by 

^iMi,^(^) ^= U MB),...,Xnm- 

\\A-B\\<e 

If A is normal then its pseudo-spectrum for the operator norm |Hl2_j.2 coincides with the 
e-neighborhood of its spectrum. The pseudo-spectrum can be much larger for non-normal 
matrices. For instance, if A is the nilpotent matrix considered earlier, then the asymptotic 
(as n — 00) pseudo-spectrum for the operator norm contains the unit disc if Kn is well 
chosen. For more, see the book |151) . 

2. Quarter circular and circular laws 

The variance of a random variable Z on C is Var(Z) = E(|Z|2) - |E(Z)|2. Let (Xij). 
be an infinite table of i.i.d. random variables on C with variance 1. We consider the square 
random matrix X := (Xjj)i<jj<„ as a random variable in A4„(C). We write a.s., a. a., and 
a.e. for almost surely, Lebesgue almost all, and Lebesgue almost everywhere respectively. 

We start with a reformulation in terms of singular values of the classical Marchenko- 
Pastur theorem for the "empirical covariance matrix" -XX*. This theorem is universal 
in the sense that the limiting distribution does not depend on the law of Xn. 

Theorem 2.1 (Marchenko-Pastur quarter circular law), a.s. Vn-^l'^x '^0-2 as n ^ 00, 
where Q2 is the quarter circular lav^ on [0, 2] C with density x 1— t- Tr^^\/4 — x^l[o,2](^)- 



Actually, it is a quarter ellipse rather than a quarter circle, due to the normalizing factor I/tt. However, 
one may use difTerent scales on the horizontal and vertical axes to see a true quarter circle, as in figure [2] 
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The n normalization factor is easily understood from the law of large numbers: 
J s^dv^^.i.^{s) = ^,j2s,{Xf = ^,Tt{XX*) = |X,,f ^ E(|Xi,i|2). (2.1) 

i=l *ii=l 

The central subject of these notes is the following counterpart for the eigenvalues. 

Theorem 2.2 (Girko circular law), a.s. fi^-i/2x ^ Ci as n ^ oo, where Ci is the circular 
laii^ which is the uniform law on the unit disc of C with density z i— )• t^~^'^{z^'C:\z\<\]- 

Note that if Z is a complex random variable following the uniform law on the unit disc 
{z G C : l^l < 1} then the random variables d\t{Z) and 3m{Z) follow the semi circular 
law on [—1,1], but are not independent. Additionally, the random variables |lHe(Z)| and 
|Jm(Z)| follow the quarter circular law on [0,1], and |Z| follows the law with density 
p I—)- |pl[o,i](p)- We will see in section [5] that the notion of freeness developed in free 
probability is the key to understand these relationships. An extension of theorem 12.11 is 
the key to deduce theorem 12.21 via a Hermitization technique, as we will see in sectional 

The circular law theorem 12.21 has a long history. It was established through a sequence 
of partial results during the period 1965-2009, the general case being finally obtained by 
Tao and Vu |150j . Indeed Mehta |113j was the first to obtain a circular law theorem for 
the expected empirical spectral distribution in the complex Gaussian case, by using the 
explicit formula for the spectrum due to Ginibre [53j. Edelman was able to prove the 
same kind of result for the far more delicate real Gaussian case [H]. Silverstein provided 
an argument to pass from the expected to the almost sure convergence in the complex 
Gaussian case [84j . Girko worked on the universal version and came with very good ideas 
such as the Hermitization technique |5ll Ep [58l [59l [60] . Unfortunately, his work was con- 
troversial due to a lack of clarity and rigoi|j. In particular, his approach relies implicitly on 
an unproved uniform integrability related to the behavior of the smallest singular values 
of random matrices. Let us mention that the Hermitization technique is also present in 
the work of Widom |155j on Toeplitz matrices and in the work of Goldsheid and Kho- 
ruzhenko ^3]. Bai [10] was the first to circumvent the problem in the approach of Girko, 
at the price of bounded density assumptions and moments assumptionqj. Bai improved 
his approach in his book written with Silverstein [11]. His approach involves the control of 
the speed of convergence of the singular values distribution. Sniady considered a universal 
version beyond random matrices and the circular law, using the notion of *-moments and 
Brown measure of operators in free probability, and a regularization by adding an inde- 
pendent Gaussian Ginibre noise [138] . Goldsheid and Khoruzhenko [M] used successfully 
the logarithmic potential to derive the analogue of the circular law theorem for random 
non-Hermitian tridiagonal matrices. The smallest singular value of random matrices was 
the subject of an impressive activity culminating with the works of Tao and Vu [145] 
and of Rudelson and Vershynin [128] , using tools from asymptotic geometric analysis and 
additive combinatorics (Littlewood-Offord problems). These achievements allowed Gotze 
and Tikhomirov [66j to obtain the expected circular law theorem up to a small loss in the 
moment assumption, by using the logarithmic potential. Similar ingredients are present 
in the work of Pan and Zhou [116] . At the same time, Tao and Vu, using a refined bound 
on the smallest singular value and the approach of Bai, deduced the circular law theorem 
up to a small loss in the moment assumption |146] . As in the works of Girko, Bai and 
their followers, the loss was due to a sub-optimal usage of the Hermitization approach. 

It is not customary to call it instead the "disc law" . The terminology corresponds to what we actually 
draw: a circle for the circular law, a quarter circle (actually a quarter ellipse) for the quarter circular law, 
even if it is the boundary of the support in the first case, and the density in the second case. See figure [21 
^Girko's writing style is also quite original, see for instance the recent paper |61j . 

^. . . I worked for IS years from 1984 1997, which was eventually published in Annals of Probability. 
It was the hardest problem I have ever worked on. Zhidong Bai, interview with Atanu Biswas in 2006 [36) . 
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In [150] , Tao and Vu finally obtained the full circular law theorem 12.21 by using the full 
strength of the logarithmic potential, and a new control of the count of the small singular 
values which replaces the speed of convergence estimates of Bai. See also their synthetic 
paper [147]. We will follow essentially their approach in section 2] to prove theorem 12.21 




-1 -0.5 0.5 1 -1 -0.5 0.5 



Figure 2. Illustration of universality in the quarter circular law and the 
circular law theorems 12.11 and 12.21 The plots are made with the singu- 
lar values (upper plots) and eigenvalues (lower plot) for a single random 
matrix X of dimension n = 1000. On the left hand side, Xn follows a 
standard Gaussian law on M, while on the right hand side Xn follows a 
symmetric Bernoulli law on {—1, 1}. Since X has real entries, the spec- 
trum is symmetric with respect to the real axis. A striking fact behind such 
simulations for the eigenvalues (lower plots) is the remarkable stability of 
the numerical algorithms for the eigenvalues despite the sensitivity of the 
spectrum of non-normal matrices. Is it the Sniady regularization of Brown 
measure theorem |138] at work due to floating point approximate numerics? 

The a.s. tightness of fi^-i/2x is easily understood since by Weyl's inequality we obtain 

/ |A|2 dMn-V2x(A) = A E ^ ^ E W = dl^n-^/^xis)- 

^ t=l " i=l 

The convergence in the couple of theorems above is the weak convergence of probability 
measures with respect to continuous bounded functions. We recall that this mode of 
convergence does not capture the convergence of the support. It implies only that a.s. 

lim si{n-^/^X) > 2 and lim |Ai(n"^/2x)| > 1. 
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However, following [Il[l2l[IT] and [l3l[Iig, if E(Xi,i) = and E(|Xi,i|^) < oo then a.sE] 
lim Si(n"^/2X) = 2 and lim | Ai(n"^/2j^)| = 1. 

The asymptotic factor 2 between the operator norm and the spectral radius indicates in 
a sense that X is a non-normal matrix asymptotically as n — )• oo (note that if Xn is 
absolutely continuous then X is absolutely continuous and thus XX* ^ X*X a.s. which 
means that X is non-normal a.s.). The law of the modulus under the circular law has 
density p i— )■ 2pl[o,i](p) which differs completely from the shape of the quarter circular law 
s I— 7- 7r~-^-v/4 — l[o^2]('5)) see figure [3l The integral of "log" for both laws is the same. 





ModuleCircleLaw 




QuarterCircle 











0.5 1 1.5 2 



Figure 3. Comparison between the quarter circular distribution of theo- 
rem [21T] for the singular values, and the modulus under the circular law of 
theorem 12.21 for the eigenvalues. The supports and the shapes are different. 
This difference indicates the asymptotic non-normality of these matrices. 
The integral of the function 1 1— )• log(t) is the same for both distributions. 



3. Gaussian case 

This section is devoted to the case where Xn ~ ^f{0, ^h)- From now on, we denote G 
instead of X in order to distinguish the Gaussian case from the general case. We say that 
G belongs to the Complex Ginibre Ensemble. The Lebesgue density of the n x n random 
matrix G = (G',j)i<ij<„ in 7W„(C) = is 

where A* the conjugate-transpose of A. This law is a Boltzmann distribution with energy 

n n 
i,j=l i=l 

This law is unitary invariant, in the sense that if U and V are n x n unitary matrices 
then UGV and G are equally distributed. If Hi and H2 are two independent copies of 



-"^•^The argument is based on Gelfand's formula: if A G M„{C) then |Ai(A)| = limfc-^oo for 
any norm ||'|| on A4n{C) (recall that all norms are equivalent in finite dimension). In the same spirit, the 
Yamamoto theorem states that lim^^oo Si(j4'°)^'''" = |Ai(^)| for every 1 < i < n, see [821 Theorem 3.3.21]. 
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GUeEI then {Hi + ii^a) /V2 has the law of G. Conversely, the matrices {G + G*)/V2 and 
{G — G*)/\/2i are independent and belong to the GUE. 

The singular values of G are the square root of the eigenvalues of the positive semidefinite 
Hermitian matrix GG* . The matrix GG* is a complex Wishart matrix, and belongs to 
the complex Laguerre Ensemble (/3 = 2). The empirical distribution of the singular values 
of n~^/^G tends to the Marchenko-Pastur quarter circular distribution (Gaussian case in 
theorem 12. This section is rather devoted to the study of the eigenvalues of G, and in 
particular to the proof of the circular law theorem 12.21 in this Gaussian settings. 

Lemma 3.1 (Diagonalizability) . For every n > 1, the set of elements of A^„,(C) with 
multiple eigenvalues has zero Lebesgue measure in C"^". In particular, the set of non- 
diagonalizable elements o/A^„,(C) has zero Lebesgue measure in C"^". 

Proof. If ^ G A^„(C) has characteristic polynomial 

Pa(^) = ^" + a„_iz"-i + --- + ao, 

then uq, . . . , an-i are polynomials of the entries of A. The resultant R{Pa, P^) of Pa, P'a, 
called the discriminant of Pa, is the determinant of the (2n — 1) x (2n — 1) Sylvester matrix 
of Pa, P'a- It is a polynomial in oq, . . . , a^-i. We have also the Vandermonde formula 

\R{Pa,P'a)\=\{\HA)-\,{A)\''- 

i<j 

Consequently, A has all eigenvalues distinct if and only if A lies outside the proper poly- 
nomial hyper-surface {A £ C"^" : R{Pa, Pa) = 0}. □ 

Since G is absolutely continuous, we have a.s. GG* ^ G*G (non-normality). Addi- 
tionally, lemma [3T] gives that a.s. G is diagonalizable with distinct eigenvalues. Following 
Ginibre |53j - see also }1141I491 Chapter 15] and [96j - one may then compute the joint den- 
sity of the eigenvalues Xi{G), . . . , A„(G) of G by integrating (j3.ip over the non-eigenvalues 
variables. The result is stated in theorem 13.21 below. It is worthwhile to mention that 
in contrast with Hermitian unitary invariant ensembles, the computation of the spectrum 
law is problematic if one replaces the square potential by a more general potential, see 
[96]. The law of G is invariant by the multiplication of the entries with a common phase, 
and thus the law of the spectrum of G has also the same property. In the sequel we set 

A„ := {{zi,...,Zn) G C" : |zi| > ••• > \zn\}. 
Theorem 3.2 (Spectrum law). (Ai(G), . . . , A„(G)) has density nlipnlA^ where 

ipnizi, ...,Zn)= . . . ^, exp - ^ |zfcP JJ l^i-^f. 

\ k=l J l<i<j<n 

In particular, for every symmetric Borel function F : C" — )■ M, 

E[F(Ai(G),... ,A„(G))] = / F{zi,...,Zn)(pn{zi,...,Zn)dzi---dzn- 
We will use theorem 13.21 with symmetric functions of the form 

F{zi,...,Zn)= ^ f{Zi,)--- f{zij. 

il,...,ik distinct 

The Vandermonde determinant comes from the Jacobian of the diagonalization, and can 
be interpreted as an electrostatic repulsion. The spectrum is a Gaussian determinantal 
point process, see [83l Chapter 4]. 

"'^^Up to scaling, a random nxn Hermitian matrix H belongs to the Gaussian Unitary Ensemble (GUE) 
when its density is proportional to i-> exp(— |Tr(ff^)) — exp( — | — X]i<i<j<n 1^)- 

Equivalently {Ha, Hij 1 < i < n, i < j < n} are indep. and Ha ~ A/'(0, 1) and Hij A/'(0, I/2) for i 7^ j. 
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Theorem 3.3 (/c-points correlations). Let z G C i— >• 7(2) = vr^-'^e^l^l^ be the density of the 
standard Gaussian Af{0, ^12) on C. Then for every 1 < k < n, the "k-point correlation" 

fn.k{zi,...,Zk) := / ipn{zi,...,Zk)dZk+l---dZn 

JCn-k 

satisfies 

<^n,k{^i, ■■■,Zk) = — —^^ — -T^''' 7(^1) • • • lizk) det [K{zi, ^•)]i<ij<fc 

where 

K{zi, zj) := = E He{zi)He{zjr with H,{z) := —/ . 

In particular, by taking k = n we get 

1 _ 2 

Vn,n{^l, ...,Zn)= iPn{zi, . . . , Z„) = —IT " 7(^1) • • • 7(z„) det [iir(zi, ^j)]i<ij<„- 

Proof. Calculations made by [1141 Chapter 15 page 271 equation 15.1.29] using 

JJ \Zi-Zj\'^= Yl (Zi-Zj) Yl i^i-^jT 
l<i<j<n l<*<i<»^ 1<'<J<'^ 

and 



det 



i<..</^* [(^l)^"]i<M<n = ^det[i^(z.,.,)],<^^^.<„. 



□ 



Recall that if // is a random probability measure on C then E/x is the deterministic 
probability measure defined for every bounded measurable / by 



Theorem 3.4 (Mean circular Law). E/i^_i/2Q Ci as n ^ 00, where C\ is the circular 
law i.e. the uniform law on the unit disc of C with density z 1— t- TT~^'i-{zeC:\z\<i}- 

Proof. From theorem 13.31 with k = 1, we get that the density of E/ug is 



n ^ — ' / nir 



Following Mehta |1141 Chapter 15 page 272], some calculus gives for every compact C C C 

lim sup \ncpn,iiVnz) - vr^^lro ii(|2;|)] = 0. 
zee L . J I 

The n in front of ipn,i is due to the fact that we are on the complex plane C = and 
thus dy/nxdy/ny = ndxdy. Here is the start of the elementary calculus: for < n, 



e 



while for r > n, 



'^^2£ r'^{n-l) 1^ / ^ _ ^2(n~l) ^2 



^ i] - (n-l)! ^ 

^2 _ I /Z7^|2 



< 



(n — 1)! — n + 1 



By taking r = |-(/n2:| we obtain the convergence of the density uniformly on compact 
subsets, which implies in particular the weak convergence. □ 
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The sequence {Hk)k£'N forms an orthonormal basis (orthogonal polynomials) of square 
integrable analytic functions on C for the standard Gaussian on C. The uniform law on 
the unit disc (known as the circular law) is the law of VV e^™'^ where V and W are 
i.i.d. uniform random variables on the interval [0,1]. This point of view can be used to 
interpolate between complex Ginibre and GUE via Girko's elliptic laws, see [1001 190[ I19j. 

We are ready to prove the complex Gaussian version of the circular law theorem 12.21 



Theorem 3.5 (Circular law), a.s. f^n-^/^G Ci as n ^ oo, where Ci is the circular law 
i.e. the uniform law on the unit disc of C with density z i— )■ 'n'~^l^z^c-\z\<i}- 

Proof. We reproduce Silverstein's argument, published by Hwang [84J. The argument is 
similar to the quick proof of the strong law of large numbers for independent random 
variables with bounded fourth moment. It suffices to establish the result for compactly 
supported continuous bounded functions. Let us pick such a function / and set 

Sn-= / fdfj,^-i/2G and 5oo := vr"^ / f{z)dxdy. 
Jc •^kl<i 

Suppose for now that we have 

E[{Sn-ESnf] = 0{n-^). (3.2) 
By monotone convergence (or by the Fubini-Tonelli theorem). 



oo 



n=l n=l 

and consequently Yl'^=i i'^n — lESn)^ < oo a.s. which implies lim„_^oo Sn — = a.s. 
Since lim^^oo = Soo by theorem 13. 4|, we get that a.s. 

lim Sn = Soo- 

n— >oo 

Finally, one can swap the universal quantifiers on uj and / thanks to the separability of 
the set of compactly supported continuous bounded functions C — >■ M equipped with the 
supremum norm. To establish ()3.2p . we set 

1 " 

Sn-ESn = -Y,Z^ ^i^h := / (n-i/^c) ) . 
1=1 

Next, we obtain, with running over distinct indices in 1, . . . , n, 

E 



n 



The first three terms of the right hand side are 0(n~^) since maxi<j<„ |Zj| < 
Finally, some calculus using the expressions of (/?„,3 and ipn,4: provided by theorem 13.3 
allows to show that the remaining two terms are also 0(n~^). See Hwang [84, p. 151]. □ 
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It is worthwhile to mention that one can deduce the circular law theorem 13.51 from a 
large deviations principle, bypassing the mean circular law theorem 13. 4^ see section [5j 

Following Kostlan |97) (see also Rider |122] and [83]) the integration of the phases in 
the joint density of the spectrum given by theorem 13.21 leads to theorem 13.61 below. 

Theorem 3.6 (Layers). If Zi, . . . , Zn are independent witl^ Z| ~ T{k, 1) for all k, then 

(|Ai(G)|,...,|A„(G)|)^(Z(i),...,Z(„)) 
where Z^^^ , ■ ■ ■ , Z(^^-^ is the TioTi-iTicTccLsiTig TcovdcTing of the sequence • • • •> ^n* 

Note thalEl {y/2Zkf ~ X^(2fe) which is useful for V2G. Since Zl = Ei + ---+Ek where 
El, . . . , Ek are i.i.d. exponential random variables of unit mean, we get, for every r > 0, 

P(|Ai(G)| < V^r) = II rf ^'^"^^^" <rA 

l<k<n ^ ^ 

The law of large numbers suggests that r = 1 is a critical value. The central limit theorem 
suggests that |Ai(n~^/^G)| behaves when n S> 1 as the maximum of i.i.d. Gaussians, for 
which the fluctuations follow the Gumbel law. A quantitative central limit theorem and 
the Borel-Cantelli lemma provides the follow result. The full proof is in Rider [122J. 

Theorem 3.7 (Convergence and fluctuation of the spectral radius). 

lim |Ai(n-V2(^)| = 

71— >00 

Moreover, if '■= log(n/27r) — 21og(log(n)) then 



y/^(^\Xi{n''/^G)\-l 




where Q is the Gumbel law with cumulative distribution function x i— )• e on R. 

The convergence of the spectral radius was obtained by Mehta [1141 chapter 15 page 271 
equation 15.1.27] by integrating the joint density of the spectrum of theorem 13.21 over the 
set ni<i<n,{|Ai| > r}. The same argument is reproduced by Hwang [HH pages 149-150]. 
Let us give now an alternative derivation of theorem 13.41 From theorem 13. 7^ the sequence 
(E/i^_i/2(j)n>i is tight and every accumulation point fj, is supported in the unit disc. From 
theorem 13.21 such a /i is rotationally invariant, and from theorem 13.61 the image of /i by 
z £ C \z\ has density r i— )■ 2rl[o,i](?") (use moments!). Theorem 13.41 follows immediately. 

It is remarkable that the large eigenvalues in modulus of the complex Ginibre ensemble 
are asymptotically independent, which gives rise to a Gumbel fluctuation, in contrast with 
the GUE and its delicate Tracy- Widom fluctuation, see [90] for an interpolation. 

Remark 3.8 (Real Ginibre Ensemble). Ginibre considered also in his paper [53| the case 
where C is replaced by M or by the quaternions. These cases are less understood than the 
complex case due to their peculiarities. Let us focus on the Real Ginibre Ensemble, studied 
by Edelman and his collaborators. The expected number of real eigenvalues is equivalent to 
y2n/7r as n ^ oo, see |42J, while the probability that all the eigenvalues are real is exactly 
2-n{n-i)/4^ see [4H Corollary 7.1]. The expected counting measure of the real eigenvalues, 
scaled by yfn, tends to the uniform law on the interval [—1,1], see [i5| Theorem 4.1] and 
figures [7} The eigenvalues do not have a density in C", except if we condition on the 
real eigenvalues, see [H] . The analogue of the weak circular law theorem \3.4\ was proved 
by Edelman [421 Theorem 6.3]. More information on the structure of the Real Ginibre 
Ensemble can be found in [2], [27], and in [96j and [49 1 Chapter 15]. 



12 

-^%ere x^(»^) stands for the law of \\V\\l where V ~ 7V(0, /„ 



Here T{a, A) stands for the probabihty measure on R+ with Lebesgue density x i— > AT(a) 

2 
2 
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On overall, one can remember that the Complex Ginibre Ensemble is in a way "simpler" 
than the GUE while the Real Ginibre Ensemble is "harder" than the GOE. 



Real Ginibre > GOE > GUE > Complex Ginibre 



500 Gaussian 300x300 



500 Bernoulli 300x300 



Figure 4. Histograms of real eigenvalues of 500 i.i.d. copies of n-^l'^X 
with n = 300. The left hand side graphic corresponds to the standard real 
Gaussian case Xu ~ A/'(0, 1), while the right hand side graphic corresponds 
to the symmetric Bernoulli case Xu ~ + Si). See remark 



Remark 3.9 (Quaternionic Ginibre Ensemble). The quaternionic Ginibre Ensemble was 
considered at the origin by Ginibre [53j . It has been recently shown [18j by using the 
logarithmic potential that there exists an analogue of the circular law theorem for this 
ensemble, in which the limiting law is supported in the unit ball of the quaternions field. 

4. Universal case 

This section is devoted to the proof of the circular law theorem 12.21 following [150J . The 
universal Marchenko-Pastur theorem 12.11 can be proved by using powerful Hermitian tech- 
niques such as truncation, centralization, the method of moments, or the Cauchy-Stieltjes 
trace-resolvent transform. It turns out that all these techniques fail for the eigenvalues of 
non-normal random matrices. Indeed, the key to prove the circular law theorem 12.21 is to 
use a bridge pulling back the problem to the Hermitian world. This is called Hermitization. 

Actually, and as we will see in sections [5] and El there is a non-Hermitian analogue of the 
method of moments called the *-moments, and there is an analogue of the Cauchy-Stieltjes 
trace-resolvent in which the complex variable is replaced by a quaternionic type variable. 

4.1. Logarithmic potential and Hermitization. Let V{C) be the set of probability 
measures on C which integrate log|-| in a neighborhood of infinity. The logarithmic po- 
tential Ufj_ fj, G V{C) is the function [/^ : C — )• (— oo, +oo] defined for all z G C by 



U^{z) = - \og\z - \\dii{X) = -{\og\-\* ^l){z). (4.1) 
Jc 

For instance, for the circular law Ci of density T^^^'^{ze€:\z\<i}7 we have, for every z G C, 



''''^''^ = \\ix-\z?) ifN<i, ^'-'^ 



2 ' 
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see e.g. [130] . Let P'(C) be the set of Schwartz-Sobolev distributions. We have V{C) C 
P'(C). Since log |-| is Lebesgue locally integrable on C, the Fubini-Tonelli theorem implies 
that is Lebesgue locally integrable on C. In particular, < oo a.e. and G P'(C). 
Let us define the first order linear differential operators in D'(C) 

d-=^{dx-idy) and d := ^{d^ + idy) (4.3) 

and the Laplace operator A = Add = Add = d"^ + dy. Each of these operators coincide 
on smooth functions with the usual differential operator acting on smooth functions. By 
using Green's or Stockes' theorems, one may show, for instance via the Cauchy-Pompeiu 
formula, that for any smooth and compactly supported function 93 : C — )• M, 

- / Alp{z) log \z\ dxdy = 2'Kip{<d) (4.4) 

where z = x + iy. Now ()4.4p can be written, in P'(C), 

A log I -I = 2tt5o 

In other words, ^ log |-| is the fundamental solution of the Laplace equation on M?. Note 
that log I • I is harmonic on C \ {0} . It follows that in V (C) , 

AC/^, = -27r^. (4.5) 

This means that for every smooth and compactly supported "test function" 99 : C — )• R, 



(AC/^,99)^,(Q = - / Aip{z)Uf,{z) dxdy = -2tt / ip{z) dfi{z) = -(27r/i, (4.6) 

where z = x + iy. This means that —-^U. is the Green operator on (Laplacian inverse). 

Lemma 4.1 (Unicity). For every /x, G ^{C), ifU^ = Uy a.e. then fi = v. 

Proof. Since C/^ = Uu in V'{C), we get AU^ = AUu in ^'(C). Now (g^]) gives = v 
in 2?'(C), and thus /i = 1/ as measures since /x and v are Radon measures. Note more 
generally that the lemma remains valid = + h for some harmonic h £ D'{C). □ 

If ^4 is a n X n complex matrix and Pa{z) := det{A — zl) is its characteristic polynomial, 

U^Az) = - [ log\\ - z\di,A{\) = --log\det{A - zl)\ = --log\PAiz)\ 
Jc n n 

for every z G C \ {Ai(yl), . . . , A„(^)}. We have also the alternative expressior0 

1 r°° 

U,^A^) = — logdet(V(^ - zI){A - zI)*) = - / \og{t)duA~zi{t). (4.7) 

Jo 

One may retain from this determinantal Hermitization that for any A G A^„(C), 



knowledge of i'a-zI for a. a. 2; G C =^ knowledge of fj,A 



Note that from (|4.5p . for every smooth compactly supported function 99 : C — ?■ R, 

2tt J fdfiA = J (^95) log I -Pa I dxdy. 

The identity (j4.7|) bridges the eigenvalues with the singular values, and is at the heart of 
the next lemma, which allows to deduce the convergence of fiA from the one of va-zI- The 
strength of this Hermitization lies in the fact that contrary to the eigenvalues, one can 
control the singular values with the entries of the matrix using powerful methods such as 
the method of moments or the trace-resolvent Cauchy-Stieltjes transform. The price paid 
here is the introduction of the auxiliary variable z. Moreover, we cannot simply deduce 
the convergence of the integral from the weak convergence of i^a-zI since the logarithm 



-'^^Girko uses the name "IZ-transform of ha" , where V stands for "Victory" 
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is unbounded on M_(.. We circumvent this problem by requiring uniform integrability. We 
recall that on a Borel measurable space {E,£), a Borel function /:£'—)■ R is uniformly 
integrable for a sequence of probability measures (r/n)n>i on E when 



lim sup / I/I drjn = 0. 

*->oOn>l JUf\>t\ 



t^oOn>l J{\f\>t} 

We will use this property as follows: if as n — )• oo for some probability measure i] 

and if / is continuous and uniformly integrable for {r]n)n>i then / is //-integrable and 

lim I f dr]n= if di]. 

Remark 4.2 (Weak convergence and uniform integrability in probability). Let T be a 
topological space such as M or C, and its Borel a-field T . Let {r]n)n>i ^ sequence of 
random probability measures on {T, T) and r] be a probability measure on (T, T) . We say 
that r]n r] in probability if for all bounded continuous / : T — )• M and any e > 0, 



fdrjn- f dri 



> e = 0. 



Hm P 

n— >oo 

This is implied by the a.s. weak convergence. We say that a measurable function f : T 
is uniformly integrable in probability for (f?ra)„>i when for any e > 0, 




lim supP / \f\dr]n > e] =0. 

t^°°n>l \J\f\>t J 

We will use this property as follows: ifrjn rj in probability and if f is uniformly integrable 
for (?7n)n>i probability then f is rj-integrable and J f dr]n converges in probability to 
ffdr]. This will be helpful in section \^ together with lemma \J7S\ in order to circumvent 
the lack of almost sure bounds on small singular values for heavy tailed random matrices. 

The idea of using Hermitization goes back at least to Girko |54j . However, the proofs 
of lemmas 14.31 and 14.51 below are inspired from the approach of Tao and Vu |150j . 

Lemma 4.3 (Hermitization). Let {An)n>i be a sequence of complex random matrices 
where An is n x n for every n > 1. Suppose that there exists a family of (non-random) 
probability measures {i^z)z£C on such that, for a. a. z £ C, a.s. 

(i) iyA„-zi -^Vz as n ^ oo 

(ii) log is uniformly integrable for {vAn-zl)n>i- 

Then there exists a probability measure /i G V{C) such that 
(j) a.s. fj,A„ /U as n oo 
(jj) for a.a. z € C, 

poo 

U^{z) = - I log{s)dL'zis). 







Moreover, if the convergence (i) and the uniform integrability (ii) both hold in probabil- 
ity for a.a. z G C (instead of for a.a. z G C, a.s.), then (j-jj) hold with the a.s. weak 
convergence in (j) replaced by the weak convergence in probability. 



Proof of lemma \4.'^ Let us give the proof of the a.s. part. We first observe that one 
can swap the quantifiers "a.a." on z and "a.s." on u in front of (i-ii). Namely, let us 
call P{z,uj) the property "the function log is uniformly integrable for {i^a„{u])-zi) ^-^i 
^a„{lo)-zI ^ ^2" ■ The assumptions of the lemma provide a measurable Lebesgue negligible 
set C in C such that for all 2; C there exists a probability one event Ez such that for all 
oj £ Ez, the property P{z, uj) is true. From the Fubini-Tonelli theorem, this is equivalent to 
the existence of a probability one event E such that for all u G E, there exists a Lebesgue 
negligible measurable set C^i in C such that for all z ^ C^^, the property P{z,uj) is true. 
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Prom now on, we fix an arbitrary u £ E. For every z C^j, let us define tlie probability 
measure i' := i^z and the triangular arrays (an,A:)i<fc<„ and (^n,A:)i<fc<„ by 

an,k ■= \>^k{An{uj) - zl)\ and bn,k ■= Sk{Aniu)) - zl). 

Note that tJ'An{uj)-zi = /^A„(aj) * (^-z- Thanks to the Weyl inequalities (jl.ip and to the 
assumptions (i-ii), one can use lemma H3] below, which gives that (^A„(a;))rt>i is tight, 
that for a.a. z £ C, log 1^ — -j is uniformly integrable for (Aiyi„{w))n>i, and that 

POO 

1™ ^/^a„m(^) = - / log(s)di/^(s) = U{z). 







Consequently, if the sequence (Myl„(a;))„>x admits two probability measures and /uj^ 
as accumulation points for the weak convergence, then both fi^ and fi'^ belong to V{C) 
and Ufj^^ = U = U^i^ a.e., which gives = ^'^ thanks to lemma WA\ Therefore, the 
sequence (^A„{a;))^>^ admits at most one accumulation point for the weak convergence. 
Since the sequence (/iA„(aj))^>]^ is tight, the Prohorov theorem implies that (^a„{lj))„>;^ 
converges weakly to some probability measure G V{C) such that C/^^ = U a.e. Since U 
is deterministic, it follows that u ^ \s deterministic by lemma HT] again . This achieves 
the proof of the a.s. part of the lemma. The proof of the "in probability" part of the 
lemma follows the same lines, using this time the "in probability" part of lemma H31 □ 

Remark 4.4 (Weakening uniform integrability in lemma The set of z inC such that 
z is an atom of IE^a„ for some n > 1 is at most countable, and has thus zero Lebesgue 
measure. Hence, for a.a. z G C, a.s. for all n > 1, z is not an eigenvalue of An. Thus for 
a.a. z G C, a.s. for all n > 1, 



log(s) dvA„-zi{s) < oo. 
Hence, assumption (ii) in the a.s. part of lemma \4-^ holds if for a.a. z & C, a.s. 



lim lim / \f\ duAn-zl{s) = 

where f = log. Similarly, regarding "in probability" part of lemma one can replace 
the sup by lim in the definition of uniform integrability in probability. 

The following lemma is in a way the skeleton of proof of lemma 14.31 (no matrices). 
It states essentially a propagation of a uniform logarithmic integrability for a couple of 
triangular arrays, provided that a logarithmic majorization holds between the arrays. 

Lemma 4.5 (Logarithmic majorization and uniform integrability). Let {an,k)i<k<n oirid 
{bn,k)i<k<n be two triangular arrays in ]R_|-. Define the discrete probability measures 

k=l k=l 

If the following properties hold 

(i) On.i > • • • > an,n and > • • • > 6„,„ for n > 1, 

(ii) HiLi ^ Ili=i bn,i for every l<A;<n/orn>l, 

(iii) n"=fc bn,i < nr=fe f^r e^^ery I < k < n for n :§> 1, 

(iv) 1/ as n ^ oo for some probability measure v, 

(v) log is uniformly integrable for {i'n)n>i, 
then 

(j) log is uniformly integrable for {fin)n>i (in particular, (/^n)„>i is tight). 
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(jj) we have, as n ^ oo, 

log{t) dfinit) = / log{t) dl^n{t) ^ / log{t)diy{t), 

Jo Jo 
and in particular, for every accumulation point /i of {nn)n>i, 

D />00 

log(t)d/i(t) = / log{t)du{t). 



Moreover, assume that {an^k)i<k<n o^'^ ibn,k)i<k<n ctre random triangular arrays in 
defined on a common probability space such that (i-ii-iii) hold a.s. and (iv-v) hold in 
probability. Then (j-jj) hold in probability. 

Proof. An elementary proof can be found in [231 Lemma C2]. Let us give an alterna- 
tive argument. Let us start with the deterministic part. From the de la Vallee Poussin 
criterion (see e.g. [371 Theorem 22]), assumption (v) is equivalent to the existence of a 
non-decreasing convex function J : — t- such that limf_^oo Ji't)/t = oo, and 

sup / J(| log(t)l) di^nit) < OO. 

n J 

On the other hand, assumption (i-ii-iii) implies that for every real valued function ip such 
that 1 1—)- (p{e^) is non-decreasing and convex, we have, for every 1 < k < n, 

k k 

'^^ian,k) < y^^V>iK,k), 

i=l i=l 

see [S21 Theorem 3.3.13]. Hence, applying this for = n and ip = J, 

sup / J{\log{t)\)dfJ.ri{t) < OO. 
n J 

We obtain by this way (j). Statement (jj) follows trivially. 

We now turn to the proof of the "in probability" part of the lemma. Arguing as in 
\37\ Theorem 22], the statement (v) of uniform convergence in probability is equivalent to 
the existence for all 5 > of a non-decreasing convex function Jg : — t- such that 
limt_^oo Js{t)/t = OO, and 

supP(^y Js{\log{t)\)dun{t) <d. 
Since is non-decreasing and convex we deduce as above 

Js{\ log(t)l) dfinit) < j Jsi\ log(t)l) dl^nit). 

This proves (j). Statement (jj) is then a consequence of remark [4.21 □ 

Remark 4.6 (Logarithmic potential and Cauchy-Stieltjes transform). We may define the 
Cauchy-Stieltjes transform : C — t- C U {oo} of a probability measure fi on C by 

"^m(^) ■= I T^(il^W- 
JC'^~ ^ 

Since 1/|-| is Lebesgue locally integrable on C, the Fubini-Tonelli theorem implies that 
m^{z) is finite for a. a. z G C, and moreover is locally Lebesgue integrable on C and 
thus belongs to P'(C). Suppose now that fj, £ 7^(C). The logarithmic potential is related 
to the Cauchy-Stieltjes transform via the identity 

= 2dU^ 

in T>'{C). In particular, since Add = 488 = A as operators on T>'{C), we obtain, in T>'{C), 

2dm^ = — AC/^ = —iTTfj.. 
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Thus we can recover /x from m^. Note that for any e > 0, is hounded on 

= {z G C : dist(2;, supp(/u)) > e). 

//supp(//) is one- dimensional then one may completely recover /j, from the knowledge ofm^ 
on Df, as e — )• 0. Note also that is analytic outside supp(/x), and is thus characterized 
by its real part or its imaginary part on arbitrary small balls in the connected components 
o/supp(/x)^. //supp(/x) is not one- dimensional then one needs the knowledge ofm^ inside 
the support to recover fi. If A€ A^„(C) then m^^ is the trace of the resolvent 

m^Az) = Tr{iA-zI)-') 

for every z G C\ {Xi{A), . . . , Xn{A)}. For non-Hermitian matrices, the lack of a Hermi- 
tization identity expressing m^^ in terms of singular values explains the advantage of the 
logarithmic potential {/^^ over the Cauchy-Stieltjes transform m^^ for spectral analysis. 

Remark 4.7 (Logarithmic potential and logarithmic energy). The term "logarithmic po- 
tential" comes from the fact that Ufj, is the electrostatic potential of jj. viewed as a distri- 
bution of charged particles in the plane C = |r3D] . The so called logarithmic energy of 
this distribution of charged particles is 



8{ii) := / U^{z) dix{z) = - / log |z - A| dfi{z)df,{X). (4.8) 
JC Jc Jc 

The circular law minimizes £{■) under a second moment constraint |13Uj . //'supp(^) C M 
then £{fJ-) matches up to a sign and an additive constant the Voiculescu free entropy for 
one variable in free probability theory [153, Proposition 4.5] (see also the formula \5.1\) . 

Remark 4.8 (From converging potentials to weak convergence). As for the Fourier trans- 
form, the pointwise convergence of logarithmic potentials along a sequence of probability 
measures implies the weak convergence of the sequence to a probability measure. We need 
however some strong tightness. More precisely, if (/Un)„>i is a sequence in 'P(C) and if 
U : C ^ {—oo, +oo] is such that 

(i) for a. a. z £ C, lim„_^oo Ufj,„{z) = U{z), 

(ii) log(l + |-|) is uniformly integrable for (/U„)„>i, 

then there exists /x G T'(C) such that = U a.e. and /i = —-^^U in D'(C) and 

Let us give a proof inspired from Proposition 1.3 and Appendix A]. From the de la 
Vallee Poussin criterion (see e.g. \67\ Theorem 22]), assumption (ii) implies that for every 
real number r > 1, there exists a non- decreasing convex function J : M+ — )• which 
may depend on r, such that lim^^^oo J{'t)/t = oo, and J{t) < 1 + t^, and 

sup / J(log(r + |A|)) dfiniX) < oo. 

n J 

Let K d C he an arbitrary compact set. We choose r = r[K) > 1 large enough so that the 
ball of radius r — 1 contains K, and therefore for every z £ K and X € C, 

J(| log |z - All) < (1 + I log \z - X\\^)l{\x\<r} + ^(log(r + |A|))1||a|>.}. 

The couple of inequalities above, together with the fact that the function (logl-l)'^ is locally 
Lebesgue integrable on C, imply, by using Jensen and Fubini-Tonelli theorems. 



sup / J {\Un{z)\) dxdy < sup Ik (z) J {\log \z — X\\) dfin{X) dxdy < oo, 

n Jk " J J 

where z = x + iy as usual. Since the de la Vallee Poussin criterion is necessary and suffi- 
cient for uniform integrability, this means that the sequence (t^/i,i)„>i ^'^ locally uniformly 
Lebesgue integrable. Consequently, from (i) it follows that U is locally Lebesgue integrable 
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and that U^^ U in 2?'(C). Since the differential operator A is continuous in P'(C), 
we find that AC/^^ — )■ AU in T>'{C). Since AC/ < 0, it follows that fi := —-^AU is a 
measure (see e.g. I79jj. Since for a sequence of measures, convergence in P'(C) implies 
weak convergence, we get Hn = —-^AU^^ = —-^AU. Moreover, by assumptions (ii) 
we get additionally that fi £ V{C). It remains to show that = U a.e. Indeed, for any 
smooth and compactly supported (/? : C — )• M, since the function log|-| is locally Lebesgue 
integrable, the Fubini-Tonelli theorem gives 

fiz)Uf,„{z) dz = - j (^j ip{z) log \z - w\ dz^ dfiniw). 

Now the function (p*log |-| : w £ C ^ f ip{z) log jz— dz is continuous and is ©(log |1 + -1). 
Therefore, by (i-ii) we get — )■ C/^ in 2?'(C) and thus = U in V'{C) and then a.e. 

4.2. Proof of the circular law. The proof of theorem 12.21 is based on the Hermitization 
lemma 1131 The part (i) of lemma is obtained from corollary 14. 101 below. 

Theorem 4.9 (Convergence of singular values with additive perturbation). Let (M„)„>i 
be a deterministic sequence such that Mn S A4„(C) for every n. If vm„ p as n ^ co 
for some probability measure p on M_|_ then there exists a probability measure Vp on 
which depends only on p and such that a.s. z^„-i/2x+m„ as n —)• oo. 

Theorem 14 . 91 app ear s as a special case of the work of Dozier and Silverstein for informa- 
tion plus noise random matrices j39j . Their proof relies on powerful Hermitian techniques 
such as truncation, centralization, trace-resolvent recursion via Schur block inversion, lead- 
ing to a fixed point equation for the Cauchy-Stieltjes transform of Vp. It is important to 
stress that Vp does not depend on the law of Xn (recall that Xn has unit variance). One 
may possibly produce an alternative proof of theorem 14.91 using free probability theory. 

Corollary 4.10 (Convergence of singular values). For all z G C, there exists a probability 
measure Vz depending only on z such that a.s. i^n-^/^x-zi ^ as n ^ oo. 

For completeness, we will give in sub-section 14.51 a proof of corollary 14.101 Note also 
that for z = 0, we recover the quarter circular Marchenko-Pastur theorem 12.11 

It remains to check the uniform integrability assumption (ii) of lemma 14.31 From 
Markov's inequality, it suffices to show that for all z G C, there exists p > such that a.s. 

lim / s~^(iz/ 1/2 Y_ 7-(s) < oo and lim / du i/2x-zli^) < i^-^) 

n— >oo J n— ^-oo J 

The second statement in ()4.9p with p < 2 follows from the strong law of large numbers 
([21^ together with (fTB]) . which gives Si{n-'^/'^X - zl) < Si{n~'^/'^X) + \z\ for ah 1 < i < n. 

The first statement in ()4.9p concentrates most of the difficulty behind theorem 12.21 In 
the next two sub-sections, we will prove and comment the following couple of key lemmas 
taken from |150j and |146j respectively. 

Lemma 4.11 (Count of small singular values). There exist cq > and < 7 < 1 such 
a.s. for n ^ 1 and n^~'^ < i < n — 1 and all M G 7W„(C), 



i(n"i/2^ + M) > Co-. 



n 

Lemma 14.111 is more meaningful when i is close to n^~'^ . For z = n — 1, it gives only a 
lower bound on si. The linearity in i corresponds to what we can expect on spacing. 



Lemma 4.12 (Polynomial lower bound on least singular value). For every a,d > 0, there 
exists b > such that if M is a deterministic complex nxn matrix with si(M) < n'^ then 

¥{sn{X + M) < n-'') < n-". 
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In particular there exists b > which may depend on d such that a.s. for n ^ 1, 

Sn{X + M) > n-\ 

For ease of notation, we write Sj in place of Si{n~^l'^X — zl). Applying lemmas |4. 1 IM. l2] 
with M = —zl and M = —z^/nl respectively, we get, for any c > 0, z G C, a.s. for n 3> 1, 



n ^ * n ^ n ^ 

«=1 «=1 j=?i-[ni^Tj+l 

The first term of the right hand side is a Riemann sum for /q^s~^ ds which converges as soon 
as < p < 1. We finally obtain the first statement in (j4.9p as soon as < p < min(7/6, 1). 
Now the Hermitization lemma ensures that there exists a probability measure E 'P(C) 
such that a.s. [ly ^ as n — )• oo and for all z G C, 



/>oo 

^iiiA = - log{s)du^{s). 
Jo 



Since Vz does not depend on the law of Xn (we say that it is then universal), it follows 
that n also does not depend on the law of Xu, and therefore, by using the circular law 
theorem 13.51 for the Complex Ginibre Ensemble we obtain that is the uniform law on 
the unit disc. Alternatively, following Pan and Zhou [1161 Lemma 3], one can avoid the 
knowledge of the Gaussian case by computing the integral of f^log{s) dvz{s) which should 
match the formula (|4.2|) for the logarithmic potential of the uniform law on the unit disc. 

4.3. Count of small singular values. This sub-section is devoted to lemma [4.111 used 
in the proof of theorem 12.21 to check the uniform integrability assumption in lemma 14.31 



Proof of lemma \4-li ■ We follow the original proof of Tao and Vu [150j. Up to increasing 



7, it is enough to prove the statement for all 2n^~'^ < i < n — 1 for some 7 € (0, 1) to be 
chosen later. To lighten the notations, we denote by si > • • • > the singular values of 
Y := n~^/^X + M. We fix 2n^~"' < i < n — 1 and we consider the matrix Y' formed by 
the first m := n — [i/2] rows of \/nY. Let s'^ > • • • > be the singular values of Y' . By 
the Cauchy-Poincare interlacing^^, we get 

Next, by lemma 14.141 we obtain 

+ • • • + s'~^r- /on = distr^ + • • • + dist^^ ,-. ,„T , 

J- n—\t/2\ i n— 1«/2|' 

where distj := dist{Rj, Hj) is the distance from the j^^ row Rj of Y' to Hj, the subspace 
spanned by the other rows of Y'. In particular, we have 

n-[i/2] n-\i/2'] 

i-;!.<-;-4< E ^T'^ E dist,-. (4.10) 

j=n-\i] j=l 

Now Hj is independent of Rj and dim{Hj) < n — | < n — n^~^, and thus, for the choice 
of 7 given in the forthcoming lemma [4.13[ 

/ n-l n-li/2] ( r ^\ 

EM U U ]'l-t.Si^ll<» 



n>l \i=2?ii-T j=l 



-'^^If A £ Ain{C) and 1 < m < n and if _B £ Aim,n{C) is obtained from A by deleting r :— n — m rows, 
then Si{A) > Si{B) > Si+r(A) for every 1 < i < m. In particular, [sm.(B), si{B)] C [s„(A), si{A)], i.e. the 
smallest singular value increases while the largest singular value is diminished. See j82i Corollary 3.1.3] 
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(note that the exponential bound in lemma 14.131 kills the polynomial factor due to the 
union bound over Consequently, by the first Borel-Cantelli lemma, we obtain that 

a.s. for n ^ 1, all In^""^ ^ i ^ n — 1, and all 1 < j < n — \i/2\ , 

dist, > — — > — — 
^ 2^/2 4 

Finally, KTUh gives > (^^)/(32n2), i.e. the desired result with cq := l/(4\/2). □ 

Lemma 4.13 (Distance of a random vector to a subspace). There exist 7 > and 5 > 
such that for all n ^ 1, 1 < i < n, any deterministic vector v G C" and any subspace H 
of with 1 < dim{H) < n — n^~^ , we have, denoting R := {Xn, . . . + v, 



P^dist(i?,i?) < ^y/n - dim{H)^ < exp( 



The exponential bound above is obviously not optimal, but is more than enough for our 
purposes: in the proof of lemma [4. IH a large enough polynomial bound on the probability 
suffices. 

Proof. The argument is due to Tao and Vu \150\ Proposition 5.1]. We first note that if H' 
is the vector space spanned by H, v and Ei2, then dim{H') < dim(/7) + 2 and 

dist{R,H) > dist{R,H') = dist{R',H'), 

where R' := R — K{R). We may thus directly suppose without loss of generality that v = 
and that E(Xjfc) = 0. Then, it is easy to check that 

E(dist(i?, Hf) = n- dim(F) 

(see computation below). The lemma is thus a statement on the deviation probability of 
dist{R, H). We first perform a truncation. Let < e < 1/3. Markov's inequality gives 

Hence, from HoefFding's deviation inequalitjl^, for n ^ 1, 

^{flM\x.k\<n^} <n-n^-'^ < exp(-2ni-2-(l -n-^)2) < eM-n^'^l- 

It is thus sufficient to prove that the result holds by conditioning on 

E„,:={\Xii\<n'',...,\Xim\<n^} with m := \n - n^~^] . 

Let Em[-] := E[- \Em',J~m] denote the conditional expectation given Em and the filtration 
J^m generated by Xi^m+i, ■ ■ ■ ,Xi^n- Let W be the subspace spanned by 

H, n = (0, . . . ,0,Xi^rn+l,- ■ ■ ,Xi^n), W = (E„[Xii], . . .,Em[Xim],0, ... ,0). 

Then, by construction dim(H^) < dim(ff) + 2 and W is J^n-measurable. We also have 

dist(i?,-H') > dist(i?,T^) = dist(y,Ty), 
where Y = (Xn — A, ... , Xim — A, 0, . . . , 0) = R — u — w and A = Em[^ii]- Next we have 

v2 



\Xii\ < n' 



l-o 1 . 



Now, let us consider the disc D := {z £ C : \z\ < n^} and define the -D™ — )• M+ convex 
function / : x i— )• dist((2;, 0, . . . , 0), W). From the triangle inequality, / is 1-Lipschitz: 

|/(x) - f{x')\ < dist(x,x'). 



"'^^If Xi, . . . ,Xn are independent and bounded real r.v. with di := max(Xi) — min(Xi), and if Sn ~ 
Xi + --- + X„, then ¥{Sn - ES„ < tn) < exp{~2n^t'^ /{df + ■■■ + dl)) for any t > 0. See [TTSl Th. 5.7]. 
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We deduce from Talagrand's concentration inequalit\P^ that 

P^(|dist(y, W) - Mm\ >t)< 4exp ("Y^) . (4-11) 
where Mm is the median of dist(y, W) under E^. In particular, 



Mm > y E„dist2(y, W) - erf. 
Also, if P denotes the orthogonal projection on the orthogonal of I^, we find 

m 

E^dist2(y, W)=Y, Em [n'] Pkk 
k=l 

(n n \ 

Pkk — ^ Pkk I 
k=l k=m+l / 

> a'^{n — dim(VF) — (n — m)) 
>a^{n- dim(i?) - n^"^ - 2) 

We choose some < 7 < e. Then, from the above expression for any 1/2 < c < 1 and 
n > 1, Mm > c^Jn - dim(F). We choose finally t = {c-l/2)^/n - dim{H) in (jiH]) . □ 

The following lemma, taken from \150\ Lemma A4] , is used in the proof of lemma 14.111 

Lemma 4.14 (Rows and trace norm of the inverse). Let 1 < m < n. If A £ 7Wm^„(C) 
has full rank, with rows Ri, . . . , Rm and R-i := span{i?j : j ^ {}, then 



= J^dist(i?i,i?_0" 



i=l i=l 

Proof. The orthogonal projection of R* on the subspace R-i is B*{BB*)^^BR* where B 
is the (m— 1) x n matrix obtained from A by removing the row R^. In particular, we have 

\ Rill - dist2{Ri, R-i f = \B*{BB*)-^BR*\l = {BR*)*{BB*)-^BR* 

by the Pythagoras theorem. On the other hand, the Schur block inversion formula states 
that if M is a m X m matrix then for every partition {1, . . . , m} = I L) P, 

{M-^)ij = {Mij - Mijc{Micj.)-^Micj)-\ (4.12) 

Now we take M = AA* and I = {i}, and we note that {AA*)ij = RiR*, which gives 

{{AA*)-\i = {R^R* - {BR*nBB*)-^BR*)-^ = dist2{Ri, R-i)'^ . 

The desired formula follows by taking the sum over i G {1, . . . , m}. □ 

Remark 4.15 (Local Wegner estimates). Lemma [4. 1 1\ says that z^„-i/2x-z/([0i ??]) ^ v/C 
for every rj > ICn"^ . In this form, we see that lemma \4.ll is an upper bound on the 
counting measure nVn-^/'^x-zi ^ small interval [0,r/]. This type of estimate has already 
been studied. Notably, an alternative proof of lemma \4-ll\ can be obtained following the 
work of |46] on the resolvent of Wigner matrices. 



If Xi, . . . , Xn are i.i.d. r.v. on D := {z € C : \z\ < r} and if / : D" — >■ R is convex, 1-Lipschitz, with 
median M, then P(|/(Xi, . . . ,X„) - M| >t) < 4exp(-Y|^) for any t > 0. See [US] and [Hi Cor. 4.9]. 
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4.4. Smallest singular value. This sub-section is devoted to lemma 13 . 1 2 1 which was used 
in the proof of theorem 12.21 to get the uniform integrabihty in lemma 14.31 

The fuU proof of lemma 13. 121 bv Tao and Vu in [146j is based on Littlewood-Offord type 
problems. The main difficulty is the possible presence of atoms in the law of the entries (in 
this case X is non-invertible with positive probability). Regarding the assumptions, the 
finite second moment hypothesis on Xn is not crucial and can be considerably weakened. 
For the sake of simplicity, we give here a simplified proof when the law of Xu has a 
bounded density on C or on M (which implies that X + M is invertible with probability 
one). In lemma lA.ll in Appendix [XJ we prove a general statement of this type at the 
price of a weaker probabilistic estimate which is still good enough to obtain the uniform 
integrability "in probability" required by lemma [331 

Proof of lemma \4-l^ with bounded density assumption. It suffices to show the first state- 
ment since the last statement follows from the first Borel-Cantelli lemma used with o > 1. 

For every x,y G and S C C"-, we set x ■ y := xiyi -|- • • • -|- Xn]M ^iid ||x||2 := \/x ■ x 
and dist(a;, S) := miuy^s \\x — y\\2- Let Ri, . . . , Rn be the rows oi X + M and set 

R-i := spa.n{Rj;j / i} 

for every 1 < i < n. The lower bound in lemma [3 . 1 6 1 gives 

min dist(i?i, R^i) < ^Sn{X + M) 

l<i<n 

and consequently, by the union bound, for any u > 0, 

F{y/nSn{X + M)<u)<n max P(dist(i?i, i?_i) < u). 

l<i<n 

Let us fix 1 < i < n. Let Yi be a unit vector orthogonal to Such a vector is not unique, 
but we may just pick one which is independent of Ri. This defines a random variable on 
the unit sphere S"""^ = {x G C" : ||x||2 = !}• By the Cauchy-Schwarz inequality, 

\Ri-Yi\ < \\Tr,{Ri)mY,\\2 = dist{Ri,R^i) 

where vTj is the orthogonal projection on the orthogonal complement of R-i. Let Vi be the 
distribution of Yi on S"~^. Since Yi and Ri are independent, for any u> 0, 

P(dist(i?„ i?_,) < n) < ¥{\Ri ■Y,\<u)= [ F{\R, ■ y\ < u) du,{y). 

Let us assume that Xu has a bounded density on C. Since ||y||2 = 1 there exists an 
index Jq £ {1, . . . , n} such that yj^^ ^ with |yjol~^ — V^- The complex random variable 
Ri-y is a sum of independent complex random variables and one of them is Xij^ yj^, which 
is absolutely continuous with a density bounded above by Consequently, by a 

basic property of convolutions of probability measures, the complex random variable Ri ■ y 
is also absolutely continuous with a density ipi bounded above by \/^llvlloo' thus 

fi\Ri-y\<u)= / l{|^|<„}(^i(s)(is < vrn^ 
Jc 

Therefore, for every 6 > 0, we obtain the desired result (the O does not depend on M) 

F{sn{X + M) < n-^-^/^) = 0{n^/^~^''). 

This scheme remains indeed valid in the case where Xu has a bounded density on M. □ 

Lemma 4.16 (Rows and operator norm of the inverse). Let A be a complex nx n matrix 
with rows Ri, . . . , Rn- Define the vector space R-i := span{i?j : j ^ i}. We have then 

n~^/^ min dist{Ri, R-i) < s„(A) < min dist(i?j, i?_j). 

l<i<n l<i<n 
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Proof of lemma \4.16 , The argument, due to Rudelson and Vershynin, is buried in |128) . 
Since A and have same singular values, one can consider the columns Ci, . . . ,Cn of A 
instead of the rows. For every column vector x £ C" and 1 < i < n, the triangle inequality 
and the identity Ax = xiCi + • • • + x„C„ give 

II Ax||2 > dist(j4x, C_i) = min ||Ax — y||2= min \\xiCi — y\\2 = \xi\dist{Ci,C-i). 

If ||x||2 = 1 then necessarily |xj| > n^^/^ for some 1 < i < n and therefore 
Sn(A) = min H^xlU > n^^^^ min distfCj, C_j). 

||2:|l2=l l<i<n 

Conversely, for every 1 < i < n, there exists a vector y with yi = 1 such that 

dist(Ci,C_i) = llyiCi H h ?/nCn||2 = ||^y|l2 > Ily|l2 ™™ ||^2;||2 > Sn(^) 

I|a;|l2=i 

where we used the fact that ||y||2 = |yip + • • • + |ynP > l^iP = 1- D 

Remark 4.17 (Assumptions for the control of the smallest singular value). In the proof 
of lemma with the bounded density assumption, we have not used the assumption on 
the second moment of Xn nor the assumption on the norm of M. 

4.5. Convergence of singular values measure. This sub-section is devoted to corol- 
lary [3?TUJ The proof is divided into five steps. 

Step One: Concentration of singular values measure. First, it turns out that it is sufficient 
to prove the convergence to of Ez/^_i/2j(^_^. Indeed, for matrices with independent rows, 
there is a remarkable concentration of measure phenomenon. More precisely, recall that 
the total variation norm of / : M — )■ M is defined as 



TV 



SUP^ |/(xfc+i) - f{xk)\, 



where the supremum runs over all sequences {xk)k<^z such that x^+i > x^ for any k G 
If / = l(_oo^s] for some s G M then H/H-pv — 1; while if / has a derivative in L^(I 
= J\f'{t)\dt. The following lemma is extracted from [26], see also [74] . 



Lemma 4.18 (Concentration for the singular values empirical measure). If M is a n x n 

complex random matrix with independent rows (or with independent columns) then for any 
/ : M — 7- M going to at ±00 with H/Hxv — ^ '^'^^ every t >0, 



//<i.M-E//..„|>t)<2exp(-2„t^). 



It is worth to mention that if M has independent entries which satisfy a uniform sub- 
Gaussian behavior, then for all Lipschitz function, the concentration of J f dvM has a rate 
n? and not n, see e.g. the work of Guionnet and Zeitouni |71] . 

Proof. If A,B G Mn{€-) and if Fa{-) := va{{-oo,-)) and Fb{-) := vb{{-oo,-)) are the 
cumulative distribution functions of the probability measures va and vb then it is easily 
seen from the Lidskii inequality for singular valueq^ that 

Now for a smooth / : M — )■ M, we get, by integrating by parts, 



/ dvA - f duB 



f'{t){FA{t)-FB{t))dt 



<'-^^^^^^ I \ f'it)\dt. 



U A,B £ Mn{C) with rank(yl - B) < k, then Si-k{A) > Si{B) > Si+k{A) for any 1 < i < n with 
the convention Si = 00 if i < 1 and Si = if i > n. This allows the extremes to blow. See [82i Th. 3.3.16]. 
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Since the left hand side depends on at most 2n points, we get, by approximation, for every 
measurable function / : M — >■ M with ||/||ty < 1, 



fdUB 



rank(A - B) , , 

< ^ ^. (4.13 

n 



From now on, / : M — t- M is a fixed measurable function with ||/||rpY < 1. For every row 
vectors rri, . . . , in C", we denote by A{xi, . . . , a;„) the nxn matrix with rows xi, . . . ,Xn 
and we define F : (C")" ^ M by 



F{xi, . . . ,Xn) := J f d^iA(xi,...,x„)- 

For any i G {1, . . . , n} and any row vectors xi, . . . , Xn, x[ of C", we have 

rank(74(xi, . . . , Xi—i, Xi, . . . , Xn) — -A[xi, . . . , Xi—i, x^, XiJ^i, . . . , < 1 
and thus 

\F[xi, . . . , , Xj, , . . . , Xn) F(^Xi, . . . , Xi—i, Xa , Xj^i , . . . , Xn)\ ^ • 

n 

Finally, the desired result follows from the McDiarmid-Azuma-Hoeffding concentration 
inequality for bounded differenced^ applied to the function F and to the random variables 
Ri,...,Rn (the rows of M). □ 

Step Two: Truncation and centralization. In the second step, we prove that it is sufficient 
to prove the convergence for entries with bounded support. More precisely, we define 

Yij = Xijl{ix,,\<K}, 

where k = k„ is a sequence growing to infinity. Then if y = (l^j)i<ij<n, we have from 
Hoffman- Wielandt inequality (jl.Sp . 



l. j2\sk{n-'/'Y-zI)-s,{n-'/'X-zI)\' <^ J] |X,,fl 
fc=i 

2 



n 



By assumption E|Xjj| goes to as k goes to infinity. Hence, by the law of large 

numbers, the right hand side of the above inequality converges a.s. to 0. On the left hand 
side we recognize the square of the Wasserstein W2 coupling distanc between '-'n-^/'^Y-zi 
and i^n-^/'^x-zi- Since the convergence in W2 distance implies weak convergence, we deduce 
that it is sufficient to prove the convergence of Ez/^_i/2y_^j to Vz- 
Next, we turn to the centralization by setting 

Zij = Yij - EYij = Yij - EXiil||Xjj|<^T,. 

Then if Z = (.Zij)i<i,j<n) we have from the Lidskii inequality for singular values, 

I /r i\i rank(y — Z) 1 

":iax|i^„-l/2y_^^([0,t]) -Z^„-l/2^_^/[0,t])| < < -. 

In particular, it is sufficient to prove the convergence of ^'^n-'^/'^z-zi ^2- 

In summary, in the remainder of this sub-section, we will allow the law of Xn to depend 
on n but we will assume that 

EXii = 0, P(|Xii| > K„) = and E|Xiip = a^, (4.14) 



"'^^If Xi , . . . , Xn are independent r.v. in Xi, . . . , Xn and if / : ^"1 x ■ • ■ x A'n — > R is a measurable 

function then P(|/(Xi, . . . ,X„) -E/(Xi, . . . >t)< 2 exp(-2t^/(c? H Vc\)) for any t > 0, where 

Ch '■= sup^ x'ev^. l/(^) ~ fi^')\ ^iid T>k ~ {{x, x') : Xi = x'^ for all i 7^ k}. We refer to McDiarmid 110'. 

^''xhe W2 distance between two probability measures 1^1,1^2 on K is W2{rii,ri2) :— inf E(|Xi — X2p)^''^ 
where the inf runs over the set of r.v. (Xi,X2) on R x R with Xi ~ rji and X2 ^ rj2. In the case where 
»?i = I ELi with < fli /> and 7,2 = i Er=i with < /- then 1^2(771, ^2? = ^ Er=i(a. " 
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where k = k„ = o{^/n) is a sequence growing to infinity and a = an goes to 1 as n goes 
to infinity. 

Step Three: Linearization. We use a popular linearization technique: we remark the iden- 
tity of the Cauchy-Stieltjes transform, for r] £ C+, 

where z/(-) = + •))/2 is the symmetrized version of a measure ly, and 

n-^/^X - 

Jn-i/2x - z)* 

Through a permutation of the entries, this matrix H{z) is equivalent to the matrix 

B{z) = B - q{z,0)(E) In 

where 

^T] z^ 

and for every 1 < i, j < n, 



H{z) :- 



q{z,r]) : 

1/0 X 



■ ^ yx,i 

Note that B{z) £ 7W„(7W2(C)) ~ A^2n(C) is Hermitian and its resolvent is denoted by 

R{q) = {B{z) - inhnT^ = iB-q{z,rj)^In)-'. 
Then R{q) £ A^„(A^2(C)) and, by (liJ5]l . we deduce that 



We set 



mkk = M'!l 5^Kl€A^2(c). 



It is easy to check that 

«(?):= -y'afc(g) = -V 4(9) and 6(g) := - V fefe(g) = - V Cfc(g), (4.16) 
n ^ — ^ n ^ — ' n — ' n — ' 

fc=i fe=i fc=i fc=i 

(see the forthcoming lemma l4.19p . So finally, 

'mu^-i/2x_J'n) = ail)- (4.17) 

Hence, in order to prove that ^'^n-''-/^x-zi converges, it is sufficient to prove that Ea(g) 
converges to, say, a{q) which, by tightness, will necessarily be the Cauchy-Stieltjes trans- 
form of a symmetric measure. 

Step Four: Approximate fixed point equation. We use a resolvent method to deduce an 
approximate fixed point equation satisfied by a{q). Schur's block inversion (j4.12p gives 



where Q G A^n-i,i(A^2(C)), 

a/tI \Xin 

and, with B = (5ij)i<ij<n-i, B{z) = B - g(z,0) ® 
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is the resolvent of a minor. We denote by Tn-x the smallest cr-algebra spanned by the 
variables {Xij)\<i^j<n-\- We notice that R is J>^_i-measurable and is independent of Q. 
If E„[-] := E[- |J^n-i], we get, using ([m]) and 



l<k/<n-l 

n ^ I 



a 
n 



k=l 

2 



dk 



E 

k=l 



at 
Ofc 



where 



ftfe bk 



Recall that B{z) is a minor of B{z). We may thus use interlacing as in (|4.13|) for the 
function f = {■ — , and we find 



n— 1 n 

fc=i 



fc=i 



< 2 



Hence, we have checked that 



Q*RQ 



\x — r]\ 



a 
a 



:dx = O 



am(?7) 



with ||ei||2 = o(l) (note here that q{z,ri) is fixed). Moreover, we define 



Since 



R 



62 := E„ 



Q*RQ-En Q*RQ [Q*RQ-K, Q*RQ 



R^^Rii 



< Jm{r])-^ and Tr ^ R*jRji = Tr (r*r) < 2n3m{r))-^. 



Also, by 



Then, an elementary computation gives 

Ik2|l2 = o 



0(1). 



n3m(?7)^^ 

Also, we note by lemma H. 181 that a{q) is close to its expectation: 



E|a(g) -Ea(g)|2 = O 



n3m{r]y 



Thus, the matrix 



D 



1 



X„ 



Q*RQ + E 



0(1). 



a 



/n \Xnn / ^"^ ' ""VO a, 
has a norm which converges to in expectation as n — t- oo. Now, we use the identity 

'1 / / ^\ \ -1 



Rnn+{q + E 



a 
a 



RnnD[q + E 



a 
o 
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Hence, since the norms of + E ^ ^ ^ and are at most O^m(ry) ^ , we get 



a 0' " 



with ||e||2 = 0(1). In other words, using exchangeabihty, 

Step Five: Unicity of the fixed point equation. From what precedes, any accumulation 
point of E ^ is solution of the fixed point equation 

" ^-+(0 ')y\ (4.18) 



a) \ \0 a 

with a = a{q) G C_)_. We find 

Oi + rj 

a — 



\z\'^ — (a + r]Y 

Hence, a is a root of a polynomial of degree 3. Hence, to conclude the proof of corollary 
I4.10^ it is sufficient to prove that there is unique symmetric measure whose Cauchy-Stieltjes 
transform is solution of this fixed point equation. For any 77 G C+, it is simple to check that 
this equation has a unique solution in C-(_ which can be explicitly computed. Alternatively, 
we know from ()4.17p and Montel's theorem that r] G C+ 1— )• a{q{z, rf)) S C+ is analytic. In 
particular, it is sufficient to check that there is a unique solution in C+ for r/ = zt, with 
t > 0. To this end, we also notice from ()4.17p that a{q) G iM+ for q = q{z,it). Hence, if 
h{z,t) = 3m{a{q)), we find 

h + t 



h 

Thus, /i 7^ and 

1 



\z\^ + {h + ty 

l + th-^ 



z|2 + (/l + t)2- 

The right hand side in a decreasing function in h on (0, 00) with limits equal to +00 and 
at — )• and /i — )• 00. Thus, there is a unique solution of the above equation. 

We have thus proved that E ^ converges. The proof of corollary 14.101 is over. 

4.6. The quaternionic resolvent: an alternative look at the circular law. 

Motivation. The aim of this sub-section is to develop an efficient machinery to analyze the 
spectral measures of a non-Hermitian matrix which avoids a direct use of the logarithmic 
potential and the singular values. This approach is built upon methods in the physics 
literature, e.g. \47\ \69\ \127\ I126j . As we will see, it is a refinement of the linearization 
procedure used in the proof of corollarv I4.10[ Recall that the Cauchy-Stieltjes transform 
of a measure i/ on M is defined, for rj £ C_|_, as 

nipij]) = I dv{x). 



r] 

The Cauchy-Stieltjes transform characterizes every probability measure on M, and actually, 
following Remark 14. 6t every probability measure on C. However, if the support of the 
measure is not one-dimensional, then one needs the knowledge of the Cauchy-Stieltjes 
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transform inside the support, which is not convenient. For a probabihty measure on C, it 
is tempting to define a quaternionic Cauchy-Stieltjes transform. For q G where 



r] z 
z rj 



we would define 



-1 



This transform characterizes the measure: in !D'(C), 

\\m{dM^{q{z,it))i2 = -nfj., 

where d is as in (14.31) and 

' r] z 



' Z 7] 

If ^ € Ain{C) is normal then -M^^ can be recovered from the trace of a properly defined 
quaternionic resolvent. If A is not normal, the situation is however more delicate and 
needs a more careful treatment. 

Definition of quaternionic resolvent. For further needs, we will define this quaternionic 
resolvent in any Hilbert space. Let H be an Hilbert space with inner product (•,•). We 
define the Hilbert space H2 = H x Z/2Z. For x = (y,e) G H2, we set x = {y,e + 1). In 
particular, this transform is an involution x = x. There is the direct sum decomposition 
H2 = Ho® Hi with H^ = {x = {y, e) : y £ H}. 

Let ^ be a linear operator defined on a dense domain D(A) C H. This operator can 
be extended to an operator on D{A) Z/2Z by setting Ax = {ay,e), for all x = (y, e) € 
D{A) Z/2Z (in other word we extend A hy A I2). We define the operator B in 
d\b) = L>(^) (g)Z/2Z by 

' A*x iix e Hq 
Ax if X G i^i . 



Bx 



For X G i^, if Hx '■ H2 — s- denotes the orthogonal projection on ((x, 0), (x, 1)), for 
x,y £ D{A), we find 

^^y--=^^^^*y={ix,A*y) ^"'o^'O^-^^(C)- 

The operator B will be called the bipartized operator of A, it is an Hermitian operator (i.e. 
for all x,y £ D{B), {Bx,y) = {x,By)). If B is essentially self-adjoint (i.e. it has a unique 
self-adjoint extension), we may define the quaternionic resolvent of A for all q G IHI_|_ as 

RA{q) = {B-Ih® q)-^ 

Indeed, if g = q{z,r]), we note that Ra is the usual resolvent at 77 of the essentially self- 
adjoint operator B(z) = B — In ® q{z,Q). Hence Ra inherits the usual properties of 
resolvent operators (analyticity in rj, bounded norm). We define 

RA{q)xy ■■= UxRAiqWy. 

If H is separable and (ei)i>i is a canonical orthonormal basis of H, we simply write Rij 
instead of i?A(9)e e > ^ Finally, if yl G 7W„(C), we set 



1 " 

rA{q) = -yZRA{qh 



kk- 

n ^ 

k=l 
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If A is normal then it can be checked that R{q)kk £ and Ta{(i) = Mf^^{q). However, 
if A is not normal, this formula fails to hold. However, the next lemma explains how to 
recover anyway fj,A from the resolvent. 

Lemma 4.19 (From quaternionic transform to spectral measures). Let A G A^n(C) and 
q = q{z,r]) G 1HI+. Then, 

Moreover, 
and, in T>'{C), 

fiA = hm db{q). 

Vr q{z,it):tiO 

Proof. For ease of notation, assume that z = and set r(-) = ^Tr(-). If P is the permu- 
tation matrix associated to the permutation (j{2k — 1) = /c, a{2k) = n + k, we get 

{B-lH®q) -P 1^^* ^--^\A*{1^^-AA*r^ r?(r/2 - AM)- V 

Hence, 



r]T{rf^ - AA*)-^ T{A{rf^ - A* A)-^) 
T{A*{ri^ - AA*)~^) r]T{ri^ - A* A)-^ 



Notice that 



1 /■ 1 1 

2 J X — T] X + 7] 

rj 



J.2 _ 



dvA{x) 



rj 

^ dflAA* {x) 



X — T] 

= 7]t{AA* - r]'^)-\ 
Note also that fj.A*A = fJ-AA* implies that 

T(r/2 - AA*)-'^ = T{rf - A* A)'^ . 

Finally, since r is a trace, 

T{A{Tf - A*A)-^) = T{{rf - A*A)-^A) 



= r((r?2 - AM)-M)* 

= r(A*(ry2 - AA*)-i). 

Applying the above to A — z, we deduce the first two statements. 
For the last statement, we write 

j\og\s + it\ dvA-z{s) = log(s2 + t2) di^A-zis) = log((yl - z){A - z)* + f). 

Hence, from Jacobi formula (see remark 14.61 for the definition of d and d) 

d j\og\s + it\ duA-z{s) = \r{{{A - z){A - z)* + f)-^d{{A - z){A - zf + t^)) 



lr{{{A-z){A-zr+t'r\A-z)) 
-l-b{q{z,it)). 
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The function /log \s + it\ dvA-z{s) decreases monotonically to 

\og{s)dvA~z{s) = -Uf,^{z) 

as t 4- 0- Hence, in distribution, 

HA = lini —dd / log \s + it\ duA-zis)- 
tio vr J 

The conclusion follows. □ 

Girko's Hermitization lemma revisited. There is a straightforward extension of Girko's 
lemma 113] that uses the quaternionic resolvent. 

Lemma 4.20 (Girko Hermitization). Let {An)n>i be a sequence of complex random ma- 
trices defined on a common probability space where An takes its values in A^n(C). Assume 
that for all q G EI+ , there exists 

such that for a.a. z £ C, r] £ C+, vuith q = q{z,r]), 

(i') a.s. (respectively in probability) TA„{q) converges to T{q) as n —)• oo 

(ii) a.s. (respectively in probability) log is uniformly integrable for {i'An-zl)n>i 

Then there exists a probability measure /x G 'P(C) such that 

(j) a.s. (respectively in probability) fiA„ as n oo 
(jj') mV'iC), 

jj, = lim db(q). 

vr q{z,it):tlO 

Note that, by lemma l4.19t assumption (i') implies assumption (i) of lemma : the 
limit probability measure I'z is characterized by 

mi>Av) = a{q). 

The potential interest of lemma 14.201 lies in the formula for /x. It avoids any use the 
logarithmic potential. 

Concentration. The quaternionic resolvent enjoys a simple concentration inequality, ex- 
actly as for the empirical singular values measure. 

Lemma 4.21 (Concentration for the quaternionic resolvent). If A is a random matrix in 
7W„(C) with independent rows (or columns) then for any q = q{z,rj) G IHI+ and t >0, 

nWrAiq) - KTAm2 >t)< 2exp 

Proof Let M,N £ A^„(C) with bipartized matrices B,C £ A^2n(C). We have 

. M, 4rank(M - N) , , 

lirM(.)-r«(,)ll.<— (4.19) 

Indeed, from the resolvent identity, for any q £ IHI4., 

D = RM{q) - RN{q) = RM{q){C - B)RN{q). 

It follows that D has rank r < rank(i? — C) = 2rank(M — N). Also, recall that the 
operator norm of D is at most 23m(r/)^^. Hence, in the singular values decomposition 

r 

= ^ SiUiV* 

i=l 



we have Si < 2Jm(r/) . If 11^ : 
then 
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is the orthogonal projection on span{e2A:-i, e2fe}, 

^ n ^ r n 



k=l 



i=l k=l 



Using Cauchy-Schwartz inequahty, 



\rM{q)-rM\\2<lj2 



1 \ \k=i 



\k=l 



i=l 



We obtain precisely ()4.19p . The remainder of the proof is now identical to the proof of 
lemma 14.181 we express r^(g) — ETA{q) has a sum of bounded martingales difference. □ 

Computation for the circular law. As pointed out in [_127] , the circular law is easily found 
from the quaternionic resolvent. Indeed, using lemma [4.211 and the proof of corollarv l4.10[ 
we get, for all q E IHI+, a.s. 

„u.r„_,,,(,)^r(,)^(|(^) 

where, from (j4.18p . 

T = -{q + dmg{T))-'B 
In the proof of corollary 14.101 we have checked that for t] = it, a{q) = ih{z,t) S iR+ where 

1 + th-^ 



We deduce easily that 



Then, from 



we find 



1 



hmh(z, t) 
40 



\z\^ + {h + ty 



a/1 - |z|2 if \z\ < 1 
otherwise. 



-z if|z|<l 
otherwise. 

As lemma 14.201 dictates, if we compose by —TT~^d we retrieve the circular law. 



lim f3{q) 

(j(z,it):40 



5. Related results and models 

5.1. Replacement principle and universality for shifted matrices. It is worthwhile 
to state the following lemma, which can be seen as a variant of the Hermitization lemma 
14.31 The next statement is slightly stronger than its original version in [150, Theorem 2.1]. 

Lemma 5.1 (Replacement principle). Let {An)n>i and (i?n)n>i be two sequences where 
An and Bn are random variables in A^„(C). If for a. a. z € C, a.s. 

(k) lim„^oo Uf,^^ {z) - U^g^ (z) = 
(kk) log(l + •) is uniformly integrable for {i'A„)n>i and {i^B„)n>i 
then a.s. ha„ — fJ-Bn as n — >• oo. 

^^This equation is the analog of the fixed point equation satisfied by the Cauchy-Stieltjes transform m 
of the semi circular law: m{ri) = —[t] + m{ri))~^. 
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A proof of the lemma follows by using the argument in the proof of lemma 14.31 Using 
their replacement principle, Tao and Vu have proved in [150J that the universality of the 
limit spectral measures of random matrices goes far beyond the circular law. We state it 
here in a slightly stronger form than the original version, see [22j . 

Theorem 5.2 (Universality principle for shifted matrices). Let X and G he the random 
matrices considered in sections \^ and^ obtained from infinite tables with i.i.d. entries. 
Consider a deterministic sequence (M„)„>i such that Mn G A^n(C) and for some p> 0, 



Then a.s. f^n-^/2x+M„ " Mn-i/2G+M„ as n ^ oo. 

5.2. Related models. We give a list of models related to the circular law theorem 12.21 

Sparsity. The circular law theorem 12.21 may remain valid if one allows the entries law to 
depend on n. This extension contains for instance sparse models in which the law has an 
atom at with mass p„ — )• 1 at a certain speed, see [Ml 11461 1156] , 

Outliers. The circular law theorem 12.21 allows the blow up of an arbitrary (asymptotically 
negligible) fraction of the extremal eigenvalues. Indeed, it was shown by Silverstein [137j 
that if E(|Xii|'') < oo and E{Xii) / then the spectral radius |Ai(n"^/^X)| tends to 
infinity at speed y/n and has a Gaussian fluctuation. This observation of Silverstein is the 
base of [31] , see also the ideas of Andrew [8] . More recently, Tao studied in [143] the outliers 
produced by various types of perturbations including general additive perturbations. 

Sum and products. The scheme of proof of theorem 12.21 (based on Hermitization, loga- 
rithmic potential, and uniform integrability) turns out to be quite robust. It allows for 
instance to study the limit of the empirical distribution of the eigenvalues of sums and 
products of random matrices, see [25], and also \U7\ in relation with Fuss-Catalan laws. 
We may also mention [115]. The crucial step lies in the control of the small singular values. 

Cauchy and the sphere. It is well known that the ratio of two independent standard real 
Gaussian variables is a Cauchy random variable, which has heavy tails. The complex 
analogue of this phenomenon leads to a complex Cauchy random variable, which is also the 
image law by the stereographical projection of the uniform law on the sphere. The matrix 
analogue consists in starting from two independent copies Gi and G2 of the Complex 
Ginibre Ensemble, and to consider the random matrix Y = G^^G2- The limit of fiy was 
analyzed by Forrester and Krishnapur |50] . Note that Y does not have i.i.d. entries. 

Random circulant matrices. The eigenvalues of a non-Hermitian circulant matrix are linear 
functionals of the matrix entries. Meckes [112] used this fact together with the central 
limit theorem in order to show that if the entries are i.i.d. with finite positive variance 
then the scaled empirical spectral distribution of the eigenvalues tends to a Gaussian law. 
We can imagine a heavy tailed version of this phenomenon with a-stable limiting laws. 

Single ring theorem. Let D G A^„(]R+) be a random diagonal matrix and U,V & A^„(C) 
be two independent Haar unitary matrices, independent of D. The law of X := UDV* 
is unitary invariant by construction, and vx = f^D (it is a random SVD). Assume that 
(Id tends to some limiting law as n — t- 00. It was conjectured by Feinberg and Zee 
|48j that (ix tends to a limiting law which is supported in a centered ring of the complex 
plane, i.e. a set of the form {z G C : r < |z| < R}. Under some additional assumptions, 
this was proved by Guionnet, Krishnapur, and Zeitouni |7Uj by using the Hermitization 
technique and specific aspects such as the Schwinger-Dyson non-commutative integration 
by parts. Guionnet and Zeitouni have also obtained the convergence of the support in a 
more recent work [73J. The Complex Ginibre Ensemble is a special case of this unitary 
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invariant model. Very recently, Khoruzhenko discovered a new and relatively simple model 
(quadratized rectangular Ginibre matrix) which gives rise to a single ring. 

Large deviations and logarithmic potential with external field. The circular law theorem 
13.51 for the Complex Ginibre Ensemble can be seen as a special case of the circular law 
theorem for unitary invariant random matrices with eigenvalues density proportional to 




(Ai,...,A„)^exp( -—yv{\) ) [[\\i-XjV 

i<j 

where y : C i— M is a smooth potential growing enough at infinity. Since 

1=1 / i<j \ 1=1 i<j 

we discover an empirical version of the logarithmic energy functional £{■) defined in 
penalized by the "external" potential V. Indeed, it has been shown by Hiai and Petz |119) 
(see also Ben Arous and Zeitouni [T7] ) that the Complex Ginibre Ensemble satisfies a large 
deviations principle at speed for the weak topology on the set of symmetric probability 
measures (with respect to conjugacy), with good rate function given by 

(Sif^) + j^'^l^-l = \jj + - 2 log I A - z\) df,{z)dfi{X) - I (5.1) 

This rate function achieves its minimum at point fi = Ci. This is coherent with the fact 
that the circular law Ci is the minimum of the logarithmic energy among the probability 
measures on C with fixed variance, see the book of Saff and Totik [130j . Note that this 
large deviations principle gives an alternative proof of the circular law for the Ginibre 
Ensemble thanks to the first Borel-Cantelli lemma. 

Dependent entries. According to Girko, in relation to his "canonical equation K20", the 
circular law theorem 12.21 remains valid for random matrices with independent rows pro- 
vided some natural hypotheses |57j . A circular law theorem is available for random Markov 
matrices including the Dirichlet Markov Ensemble [23], and random matrices with i.i.d. 
log-concave isotropic row@ [T]. Another Markovian model consists in a non-Hermitian 
random Markov generator with i.i.d. off-diagonal entries, which gives rise to a new limiting 
spectral distribution, possibly not rotationally invariant, which can be interpreted using 
free probability theory, see |24) . Yet another model related to projections in which each 
row has a zero sum is studied in |143j . To end up this tour, let us mention another kind 
of dependence which comes from truncation of random matrices with depend entries such 
as Haar unitary matrices. Namely, let U be distributed according to the uniform law on 
the unitary group Vn (we say that U is Haar unitary). Dong, Jiang, and Li have shown in 
|38) that the empirical spectral distribution of the diagonal sub-matrix {Uij)i<ij<m tends 
to the circular law if m/n — t- 0, while it tends to the arc law (uniform law on the unit 
circle G C : 1^1 = 1}) if m/n — t- 1. Other results of the same fiavor can be found in |89) . 

Tridiagonal matrices. The limiting spectral distributions of random tridiagonal Hermitian 
matrices with i.i.d. entries are not universal and depend on the law of the entries, see |120j 
for an approach based on the method of moments. The non-Hermitian version of this 
model was studied by Goldsheid and Khoruzhenko [64] by using the logarithmic potential. 
Indeed, the tridiagonal structure produces a three terms recursion on characteristic poly- 
nomials which can be written as a product of random 2x2 matrices, leading to the usage 
of a multiplicative ergodic theorem to show the convergence of the logarithmic potential 
(which appears as a Lyapunov exponent). In particular, neither the Hermitization nor 



22 



An absolutely continuous probability measure on R" is log-concave if its density is e ^ with V convex. 
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the control the smallest and small singular values are needed here. Indeed the approach 
relies directly on remark SSI Despite this apparent simplicity, the structure of the limiting 
distributions may be incredibly complicated and mathematically mysterious, as shown on 
the Bernoulli case by the physicists Holz, Orland, and Zee 



5.3. Free probability interpretation. As we shall see, the circular law and its exten- 
sions have an interpretation in free probability theory, a sub-domain of operator algebra 
theory. Before going further, we should recall briefly certain classical notions of operator 
algebra. We refer to Voiculescu, Dykema and Nica |152j for a complete treatment of free 
non-commutative variables, see also the book by Anderson, Guionnet, and Zeitouni for the 
link with random matrices [7]. In the sequel, H is an Hilbert space and we consider a pair 
(A^, r) where Ai is an algebra of bounded operators on H, stable by the adjoint operation 
*, and where r : 7W — )• C is a linear map such that r(l) = 1, r(aa*) = T(a*a) > 0. 

Definition of Brown measure. For a G A^, define \a\ = y/ aa* . For h self-adjoint element 
in A^, we denote by the spectral measure of h: it is the unique probability measure on 
the real line satisfying, for any integer /c E N, 



Also, if a € A^, we define 

= /^|a|- 

Then, in the spirit of ()4.7p . the Brown measure |30j of a G Af is the unique probability 
measure Ha on C, which satisfies for almost all z G C, 



log \z - A| d/ia(A) = J log(s) dua-z{s). 

In distribution, it is given by the formulcP^ 

fia = j log(s) dva-z{s)- (5.2) 

The fact that the above definition is indeed a probability measure requires a proof, which 
can be found in |76j . Our notation is consistent: first, if a is self-adjoint, then the Brown 
(spectral) measure coincides with the spectral measure. Secondly, if A^ = A1„(C) and 
r := ^Tr is the normalized trace on A1„(C), then we retrieve our usual definition for ua 
and ^A- It is interesting to point out that the identity (j5.2p which is a consequence of 
the definition of the eigenvalues when M = A^n(C) serves as a definition in the general 
setting of von Neumann algebras. 

Beyond bounded operators, and as explained in Brown [30] and in Haagerup and Schultz 
[76] . it is possible to define, for a class A^ D A^ of closed densely defined operators affiliated 
with A^, a probability measure on C called the Brown spectral measure of a G A^. 

Failure of the method of moments. For non-Hermitian matrices, the spectrum does not 
necessarily belong to the real line, and in general, the limiting spectral distribution is 
not supported in the real line. The problem here is that the moments are not enough 
to characterize laws on C. For instance, if Z is a complex random variable following the 
uniform law on the centered disc {z G C; |2;| < k} of radius k then for every r > 0, 
E(Z'') = and thus is not characterized by its moments. Any rotational invariant law 
on C with light tails shares with the same sequence of null moments. One can try to 
circumvent the problem by using "mixed moments" which uniquely determine /i by the 



^"^The quantity exp j \og(t) d^\a\{i) is known as the Fuglede-Kadison determinant of a £ M, see [51] . 
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Weierstrass theorem. Namely, for every A G A^„(C), if yl = UTU* is the Schur unitary 
triangularization of A then for every integers r, r' > and with z = x + iy and r = ^Tr, 

/ z^'r' dfiA{z) = - y X'iiA)X~(Af = t{TT') / TiT^T*"') = riA^A*''). 

Indeed equahty holds true when T = T* , i.e. when T is diagonal, i.e. when A is normal. 
This explains why the method of moments looses its strength for non-normal operators. 
To circumvent the problem, one may think about using the notion of ^-moments. Note 
that if A is normal then for every word A^'^ ■ ■ ■ A^^ where ei, . . . , e„ G {1, *}, we have 
r(A^i . . . A^fc) = t{A^^A*^^) where ki,k2 are the number of occurrence of A and A* . 

•k- distribution. The ★-distribution of a G is the collection of all its moments: 

where n > 1 and ei, . . . , e„ G {li *}• The element c G is circular when it has the 
distribution of (si + is2)/V^ where si and S2 are free semi circular variables with spectral 
measure of Lebesgue density x i— >• ^\/4 — x'^l[_2^2] i^)- 

The ^-distribution of a G 7W allows to recover the moments of \a — = {a — z){a — z)* 
for all 2 G C, and thus z^a-z for all z G C, and thus the Brown measure fia of a. Actually, 
for a random matrix, the ^-distribution contains, in addition to the spectral measure, an 
information on the eigenvectors of the matrix. 

We say that a sequence of matrices {An)n>i where A takes it values in 7W„(C) converges 
in moments to a G A4, if all ^-moments converge to the ^-moments of a G Ai. For 
example, if G G A^n(C) is our complex Ginibre matrix, then a.s. as n — t- oo, n^^/'^G 
converges in moments to a circular element. 

Discontinuity of the Brown measure. Due to the unboundedness of the logarithm, the 
Brown measure /Xq depends discontinuously on the ^-moments of a I138j . The limiting 
measures are perturbations by "balayage". A simple counter example is given by the 
matrices of example II. 2i For random matrices, this discontinuity is circumvented in the 
Girko Hermitization by requiring a uniform integrability, which turns out to be a.s. satisfied 
the random matrices n~^/'^X in the circular law theorem 12.21 

However, Sniady [138t Theorem 4.1] has shown that it is always possible to regularize the 
Brown measure by adding an additive noise. More precisely, if G is as above and {An)n>i 
is a sequence of matrices where A„ takes its values in A^„(C), and if the ^-moments of A^ 
converge to the ★-moments of a G as n — )• oo, then a.s. n — )• oo M^^+jn-i/zc converges 
to fia+tc, c is circular element free of a. In particular, by choosing a sequence tn going 
to sufficiently slowly, it is possible to regularize the Brown measure: a.s. tJ'A„+t„n-^/'^G 
converges to fia- Note that the universality theorem 15.21 shows that the same result holds 
if we replace G by our matrix X. We refer to Ryan |129) and references therein for the 
analysis of the convergence in moments. See also the forthcoming book of Tao |144] . 
The Sniady theorem was revisited recently by Guionnet, Wood, and Zeitouni [72^ . 

6. Heavy tailed entries and new limiting spectral distributions 

This section is devoted to the study of the analogues of the quarter circular and circu- 
lar law theorems I2.H|X^ when Xn has an infinite variance (and thus heavy tails). The 
approach taken from involves many ingredients including the Hermitization of section 
m To lighten the notations, we often abridge A — zl into A — z for an operator or matrix 
A and a complex number z. 
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Figure 5. The upper plot shows the spectrum of a single n x n matrix 

n~^/"X with n = 4000 and i.i.d. heavy tailed entries with Xu = eC/~^/° 
with a = 1 and U uniform on [0, 1] and e uniform on { — 1, 1} independent 
of U. The lower plot shows the histogram of the singular values (blue) 
and the histogram of the module of the eigenvalues (red) of this random 
matrix. The singular values vector is trimmed to avoid extreme values. 



6.1. Heavy tailed analogs of quarter circular and circular laws. We now come 
back to an array X := {Xij)Kij<n of i.i.d. random variables on C. We lift the hypothesis 
that the entries have a finite second moment: we will assume that, 

• for some < a < 2, 

lim t"P(|Xii| > t) = 1, (6.1) 

t—>-co 

• as t — 7- oo, the conditional probability 

converges to a probability measure on the unit circle := {z £ C : \z\ = 1} . 
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The law of the entries belongs then to the domain of attraction of an a-stable law. An 
example is obtained when \Xii\ and are independent with |^ii| = \S\ where S 

is real symmetric a-stable. Another example is given by Xu = eW-^/'^ with e and W 
independent such that e is supported in while W is uniform on [0, 1]. 

The interest on this type of random matrices has started with the work of the physicists 
Bouchaud and Cizeau [28J . One might think that the analog of the Ginibre ensemble is a 
matrix with i.i.d. a-stable entries. It turns out that this random matrix ensemble is not 
unitary invariant and there is no explicit expression for the distribution of its eigenvalues. 
This lack of comparison with a canonical ensemble makes the analysis of the limit spectral 
measures more delicate. We may first wonder what is the analog of the quarter circular 
law theorem 1 2.11 This question has been settled by Belinschi, Dembo and Guionnet [15] 
(built upon the earlier work of Ben Arous and Guionnet [16] ) . 

Theorem 6.1 (Singular values of heavy tailed random matrices). There exists a probability 
measure Ua on M_(. such that a.s. i^n-^/"X as n ^ oo. 

This probability measure depends only on a. It does not have a known explicit closed 
form but has been studied in [TBI ESI IS]. We know that Va has a bounded continuous 
density fa on M+, which is analytic on some neighborhood of oo. The explicit value of 
fa{x) is only known for x = 0. But, more importantly, we have 

lim r+Va(i) = a. 
In particular, inherits the tail behavior of the entries: 

lim ri^a([t,oo)) = 1. 

The measure is a perturbation of the quarter circular law: it can be proved that 
converges weakly to the quarter circular law as a converges to 2. Contrary to the finite 
variance case, the n~^/° normalization cannot be understood from the computation of 

f 1 " 

since the later diverges. A proof of the tightness of t'^-i/oj^ requires some extra care that 
we will explain later on. However, at a heuristic level, we may remark that if . . . , Rn 
denotes the rows of n~'^^°'X then for each k, 

1 " 

2 = 1 

converges weakly to a non-negative ^-stable random variable. Hence the normal- 
ization stabilizes the norm of each row of X. 

Following |26j, we may also investigate the behavior of the eigenvalues of X. Here is 
the analogue of the circular law theorem 12.21 for our heavy tailed entries matrix model. 

Theorem 6.2 (Eigenvalues of heavy tailed random matrices). There exists a probability 
measure fj-a on C such that in probability fJ-n-i/^x as n ^ oo. Moreover, if Xu has 

a bounded density, then the convergence is almost sure. 

We believe that theorem 16.21 can be upgraded to an a.s. weak convergence, but our 
method does not catch this due to slow "in probability" controls on small singular values. 

Again, the measure /i^ depends only on a and is not known explicitly. However, it is 
isotropic and has a bounded continuous density with respect to Lebesgue measure dxdy 
on C: d^aiz) = ga{\z\)dxdy. The value of ga{r) is explicit for r = 0. As r — )• oo, the tail 
behavior of ga is up to multiplicative constant equivalent to 

^2(a-l)g-fr"_ 
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This exponential decay is quite surprising and contrasts with the power tail behavior of 
fa- It indicates that X is typically far from being a normal matrix. Also, we see that the 
eigenvalues limit spectrum is more concentrated than the singular values limit spectrum. 
In fact, in the finite variance case, the phenomenon is already present: the quarter circular 
law has support [0, 2] while the circular law has support the unit disc. Again, the measure 
is is perturbation of the circular law: /i^ converges weakly to the circular law as a 
converges to 2. 

The proof of theorem 16.21 will follow the general strategy of Girko's Hermitization. 
Lemma 14.31 gives a characterization of the limit measure in terms of its logarithmic po- 
tential. Here, it turns out to be not so convenient in order to analyze the measure /Xq. 
We will rather use the quaternionic version of Girko's Hermitization, i.e. lemma [4.20i For 
statement {i') in lemma [4.20| we will prove a generalized version of theorem 16.11 

Theorem 6.3 (Singular values of heavy tailed random matrices). For all z £ C there 
exists a probability measure v^^z on such that a.s. z^„-i/ax-z ^".^ ols n ^ oo. 
Moreover, with the notations used in lemma for all q = q{z,rf) G there exists 

r(g) G EI+, such that a.s. T^^^i/axiQ) converges to T{q) and T{q)ii = rnp^ .^{ri). 

Objective method - sparse random graphs and trees. The strategy for proving theorem 16. 3| 
borrowed from [26| , will differ significantly from the proof of theorem 12. 1[ We will prove 
that n~^/"X converges in some sense, as n — >• oo, to a limit random operator A defined 
in the Hilbert space ^^(N). This will be done by using the "objective method" initially 
developed by Aldous and Steele in the context of randomized combinatorial optimization, 
see [6]. We build an explicit operator on Aldous' Poisson Weighted Infinite Tree (PWIT) 
and prove that it is the local limit of the matrices n~^^"X in an appropriate sense. While 
Poisson statistics arises naturally as in all heavy tailed phenomena, the fact that a tree 
structure appears in the limit is roughly explained by the observation that non-vanishing 
entries of the rescaled matrix n~^/"X can be viewed as the adjacency matrix of a sparse 
random graph which locally looks like a tree. In particular, the convergence to PWIT 
is a weighted-graph version of familiar results on the local tree structure of Erdos-Renyi 
random graphs. 

Free probability. We note finally that it is possible to associate to the PWIT a natural 
operator algebra with a tracial state r. Then for some operator a affiliated to M, the 
probability measure fia is equal to the Brown measure fia of a, and Va = /^|a| = is the 
singular value measure of a. See the work of Aldous and Lyons [SJ 11061 Example 9.7 and 
Sub-Section 5], and the recent work of Male |1U7] . 

6.2. Tightness and uniform integrability. 

Large singular values. We first prove the a.s. tightness of the sequences (A*n-i/ax)n>i 
(f„-i/ax-z)n>i with z £ C It is sufficient to prove that for some p > 0, for all z £ C, 




(6.2) 



Prom ()1.6p . for any A £ 7W„(C), with have Si{A — z) < Si{A) + \z\ and thus 




Moreover, from 11.31 we get, for any p > 0, 
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In summary, it it is sufficient to prove that for some p > 0, a.s. 

lim / du i/ax{s) < oo. (6.3) 
and (|6.2p will follow. We shall use the following Schatten bound: for all < p < 2, 

r 1 " 

sPduA{s)<-Y,\\Rkr2- 



k=l 



for every A G A^„(C), where . . . are the rows of A (for a proof, see Zhan |1581 
proof of Theorem 3.32]). The above inequality is an equality if p = 2 (for p > 2, the 
inequality is reversed). For our matrix, A = n~'^/°'X, we find 



r 1/1 

k=l \ 1=1 / 

The strategy of proof of ()6.3p is now clear: the right hand side is a sum of i.i.d. variables, 
and from (|6.ip . 1^^^ = n~'^/°^ X^iLi l^^^fciP the domain attraction of a non- negative a/2- 
stable law. We may thus expect, and it is possible to prove, that for q small enough, 

m EY^l < oo. 

Then, the classical proof of the strong law of large numbers for independent random 
variables bounded in implies ()6.3p . 

Uniform integrability. We will prove statement (ii) of lemma I4.2UI in probability. Fix 
z G C Using (|6.2p . we shall prove the uniform integrability in probability of min(0,log) 
for {i^n-i/ax-z)n>i' -f'^o™ Markov's inequality, it is sufficient to prove that for some c > 0, 



lim lim P / s-^du-,/^^ Js) > t = 0. (6.4) 

Arguing as in the finite variance case, the latter will in turn follow from two lemmas: 

Lemma 6.4 (Lower bound on least singular value). For all d > 0, there exist b,c > 
such that if M £ A^„(C) is deterministic with ||M||2 < n'^, then for n ^ 1, 



-b^ ^ ^ /log^ 



F(sn(X + M) < n-") < c 

V n 

The next lemma asserts that the i-th smallest singular of the random matrix n^^^"X+M 
is at least of order (i/n)^"''^"^^^ in a weak sense. This is not optimal but enough. 

Lemma 6.5 (Count of small singular values). There exist < 7 < 1 and cq > such 
that for all M £ A^„(C), there exists an event such that lim^^oo ^{Fn) = 1 and for all 
n^~^ < i < n — 1 and n ^ 1, 



E 



.;!,(n-i/"X + M)|F„] <co(^) 



Let us first check that these two lemmas imply ()6.4p (and thus statement (ii) of lemma 
I4.20p . Let us define the event ■= Fn n {s„(n~^/°X — z) > n~^}. Let us define also 

E„[-] ■.= E[-\En]. 

Since the event En has probability tending 1, the proof of ()6.4p would follow from 



hm E^ 

n— >oo 



X ^ dv^-,/^x-z{s) 



< 00. 
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For simplicity, we write Si instead Sj(n ^^"'X — zl). S ince ^ n ^ lias probability tending 
to 1, by lemma [631 for all n^~'^ < * < ?^ — 1, 



< 



Then, for < p < 2, using Jensen inequality, we find 



,(») 



1 



j=0 



< n" 



-7+p6 



+ 



1 

n 

1 

n 



n — 1 



In this last expression we discover a Riemann sum. It is uniformly bounded if p < 7/6 
and p < 2a/{a + 2). The uniform bound ()6.4p follows. 



Proof of lemma The probability that si(X) > n^^^ is upper bounded by the prob- 
ability that one of the entries of X is larger that n^. Prom Markov's inequality and the 
union bound, for p large enough, this event has probability at most 1/n. In particular, 
si{X + M) < si{X) + si(M) is at most 2n'^ for q = max{p,d) with probability at least 
1 — 1/n. The statement is then a corollary of lemma PV. 11 Note: a simplified proof in the 
bounded density case may be obtained by adapting the proof of lemma [^.121 (see pBj). □ 



Sketch of proof of lemma 16'. 5[ We now comment the proof of lemma 16.51 the detailed ar- 
gument is quite technical and is omitted here. It can be found in extenso in [26]. First, 
as in the finite variance case, the proof reduces to derive a good lower bound on 

dist\Xi,W) = {Xi,PXi), 

where Xi is the first row of X, is a vector space of Co-dimension n — d > n^~'' (in 
or C") and P is the orthogonal projection on the orthogonal of W. However, in the 
finite variance case, dist^(Xi, M/^) concentrates sharply around its average: n — d. Here, 
the situation is quite different, for instance if W = vect(e„_rf+i, . . . ,e„), we have 

n—d 



{n - dy^dist^{Xi,W) = (n - d)'^ ^ \Xii\ 



and thus (n — d)" «dist^(Xi, W) is close in distribution to a non- negative a/2-stable ran- 
dom variable, say 5. 

On the other hand, if [/ is a n x n unitary matrix uniformly distributed on the unitary 
group (normalized Haar measure), and if W is the span of the last d row vectors, then it 
can be argued than dist^(Xi, W) is close in distribution to c{n — d)n'^~^S. Hence, contrary 
to the finite variance case, the order of magnitude of the distance of Xi to the vector space 
W depends on the geometry of W with respect to the coordinate basis. We have proved 
some lower bound on this distance which are universal on W . More precisely, for any 
< 7 < a/4, there exists ci > 0, such that for some event Gn with P(G5^) < cin~("^~^'^)/", 

E[dist-2(Xi,iy) I Gn] <ci(n-d)-S. 

The above holds for n — d > n^~'^ . We have crucially used the fact that for all p > 0, 
'KS~'P is finite, i.e. the non- negative a/2-stable law is fiat in the neighborhood of 0. 



AROUND THE CIRCULAR LAW 



43 



Note: the result implies that the vector space W = vect(e„_d+i, • • • , en) reaches the 
worst possible order of magnitude. Unfortunately, the upper bound on the probability of 
the event is not good enough, and we also have to define the proper event F„ given in 
lemma This event satisfies P(i^^) < cexp(— n'^) for some 5 > and c > 0. □ 

6.3. The objective method and the Poisson Weighted Infinite Tree (PWIT). 

Local convergence. We now describe our strategy to obtain the convergence of ET^-i/ax- 
It is an instance of the objective method : we prove that our sequence of random matrices 
converges locally to a limit random operator. To do this, we first notice that a n x n 
complex matrix M can be identified with a bounded operator in 

= {{xk)keN G : 5^ Ixfcp < oo} 
k 



Yl]=i ^ji^j if 1 < i < n 
otherwise. 

With an abuse of notation, without further notice, we will identify our matrices with their 
associated bounded operator in -£^(N). The precise notion of convergence that we will use 
is the following. 

Definition 6.6 (Local convergence). LetT>(N) he the set of compactly supported elements 
of i^(N). Suppose (An) is a sequence of bounded operators on ^^(N) and A is a linear 
operator on £^(N) with domain D{A) = 2?(N). For any u,v G N we say that {An,u) 
converges locally to {A,v), and write 

{An,u) {A,v) 

if there exists a sequence of bijections (T„ : N — )• N such that 

• an{v) = u 

• for all (f) G P(N), lim„_^oo o-,^^^nC^n</' = A(j) in 

With a slight abuse of notation we have used the same symbol (T„ for the linear isometry 
an ■ £^(N) — >• £^(N) induced in the obvious way. Note that the local convergence is 
the standard strong convergence of the operator a~^AnCrn to A. This re-indexing of N 
preserves a distinguished element. It is a local convergence in the following way, if P{x, y) 
is a non- commutative polynomial in C, then the definition implies 

{^ui P{An, A^)eu) —7" {cv, P{A, A )ey). 

We shall apply this definition to random operators An and A on £^(N): to be precise, in 
this case we say that {An,u) — )• {A, v) in distribution if there exists a random bijection (T„ 
as in definition 16.61 such that a'^AnancI) converges in distribution to A(p, for all (/) G I'(N), 
where a random vector tpn G ^^(N) converges in distribution to Tp if 

hm EfiiJn) = E/(^) 

for all bounded continuous functions / : £^(N) — )• M. Finally, we may without harm replace 
N by an infinite countable set V. All definitions carry over by considering any bijection 
from N to V: namely ("^{V), for v £ V, the unit vector e^,, ^{V) and so on. 



by setting 

Me,; 
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The Poisson Weighted Infinite Tree (PWIT). We now define our limit operator on an 
infinite rooted tree witli random edge- weights, the Poisson weighted infinite tree (PWIT) 
introduced by Aldous [Ij , see also |6] . 

The PWIT is the random weighted rooted tree defined as follows. The vertex set of the 
tree is identified with N-^ := Ufc>iN'^ by indexing the root as N*^ = 0, the offsprings of the 
root as N and, more generally, the offsprings of some v G N^' as (f 1), ("^2), . . . G N*^^^ (for 
short notation, we write (fl) in place of (w,l)). In this way the set of w G N" identifies 
the n**^ generation. We then define T as the tree on N-^ with (non-oriented) edges between 
the offsprings and their parents (see figure [6|). 




Figure 6. Representation of the PWIT. 

We denote by Be(l/2) the Bernoulli probability distribution ^6q + ^5i. Also, recall 
that by assumption lim^^^oo IP'(-'^ii/|-'^ii| £ • I l-'^iil > = ^(O; ^ probability measure on 
the unit circle S^. Now, assign marks to the edges of the tree T according to a collection 
{S^l^gP^/ of independent realizations of the Poisson point process with intensity measure 
{2tj ® 6 ® Be(l/2) on M+ x x {0, 1}, where I denotes the Lebesgue measure on M+. 
Namely, starting from the root 0, let H0 = {(yi, tJi, ei), (1/2, W2, £2), • • • } be ordered in 
such a way that < yi < 7/2 < • • • > and assign the mark {yi,uJi,£i) to the offspring of 
the root labeled i. Now, recursively, at each vertex v of generation k, assign the mark 
(y«,w„i,e«) to the offspring labeled vi, where S„ = {iyvi,^vi,£vi),iyv2,^v2,i:v2), ■ ■ ■} 
satisfy < yyi < yy2 < • • • • The Bernoulli mark Eyi should be understood as an orientation 
of the edge {v, vi}: if £yi = 1, the edge is oriented from vi to v and from v to vi otherwise. 

We may define a random operator A on V(N-^) by the formula, for all v G N-^\{0} 

Acy = ^(1 - eyk)i^yky~l^°'eyk + £v'^a{v)yv^^'^ea(v) (6.5) 

k>l 

where a{v) denotes the ancestor of v, while 

-4e0 = ^(1 - £k)ujyky^^^°'ek. 

k>l 

This defines a proper operator on 'D{W). Indeed, since {yvi,yv2, . . . } is an homogeneous 
Poisson point process of intensity 2 on M+: we have a.s. limk^oo Vvk/k = 2. We thus find 

for V G N-^\{0} 

\\Aev\\l = ^(.'^ - ^vk)yyk'°' + £a(v)yy'^'°' < 00, 
k>l 

and similarly with ||j4e0||2. 



AROUND THE CIRCULAR LAW 



45 



Theorem 6.7 (Local convergence to PWIT). In distribution (n ^/"X, 1) — {A,0). 

Sketch of proof. We start with some intuition behind theorem 16. 71 The presence of Poisson 
point processes is an instance of the Poisson behavior of extreme ordered statistics. If 
Vii > Vi2 > ■ ■ ■ > Vin is the ordered statistics of vector . . . , |Xi„|) then, it is 

weU-known that the random variable in the space of non-increasing infinite sequences 

n-^/"(yii,yi2,...,14n,0,...) 

converges weakly, for the finite dimensional convergence, to 

{x-'/-,x-'/-,...) (6.6) 

where xi < X2 < ■ ■ ■ are the points of an homogeneous Poisson point process of intensity 
1 on M_|_. As observed by LePage, Woodroofe and Zinn [lOlj . this fact follows easily from 
a beautiful representation for the order statistics of i.i.d. random variables. Namely, if 
G{u) = P(|Xii| > u) is (one minus) the distribution function of l-'^iil, then 

(Vii,...,Vi„) = (G~^(xi/x„+i), . . . , G~^(x„/x„+i)), 

where G~^{u) = inf{y > : G{y) < u}, u E (0, 1). To obtain the convergence to ()6.6p . it 
remains to notice that G~^{u) ~ n~^/" as u — )• 0, and x^ ^ n a.s. as n — )• oo. 
More generally, we may reorder non-increasingly the vector 

and find a permutation vr G 5„ such that 

||(-'^l7r(l))-'^7r(l)l)||2 — ||(-'^l7r(2);-''^7r(2)l)||2 > > 1 1 (-'^l7r(n) ) -'^7r{n)l ) 1 1 2 • 

Then, the random variable (in the space of infinite sequences in of non-increasing norm) 

n"-^/"((Xi^(i),X^(i)i), (Xi^(2),X^(2)i), • • • , (Xi^(„),X^(„)i), (0,0), . . .) 
converges weakly, for the finite dimensional convergence, to 

ii^iwiyi'''"' , (1 - £1)^1^]'^''°), {£2W2y2^^°', (1 - e2)w^2y2 ^''°)' • • •) • (6-7) 

In particular, we may define a bijection (T„ in N-'^, such that o"„(0) = 1, (Tn{k) = vr(A;) if 
k 7^ 7r^^(l), and (T„ arbitrary otherwise. Then, for this sequence cj„, we may check that 
7^-i/"(7-iX(T„e0 converges weakly to Ae0 in £'^(N^). 

This is not good enough since we aim at the convergence for all (p £ -^'(N''^), not only 
60. In particular, the above argument does not explain the presence of a tree in the limit 
operator. Note however that from what precedes, only the entries such that \Xij\ > 6n^^°' 
will matter for the operator convergence (for some small 6 > 0). By assumption, 

n\Xij\ > 5n^l^) = -, 
n 

where c = c(n) ~ 5^^/". In other words, if we define G as the oriented graph on {1, . . . , n} 
such that the oriented edge {i,j) is present if \Xij\ > 5n^^" then G is an oriented Erdos- 
Renyi graph (each oriented edge is present independently of the other and with equal 
probability). An elementary computation shows that the expected number of oriented 
cycles in G containing 1 and of length k is equivalent to /n. This implies that there 
is no short cycles in G around a typical vertex. At a heuristic level, this locally tree-like 
structure of random graphs explains the presence of the infinite tree T in the limit. 

We are not going to give the full proof of theorem 16.71 For details, we refer to [25} 
The strategy is as follows. For integer m, define Jm = U^q{1, • • • , m}'^ C N-^ and consider 
the matrix obtained as the projection of the random operator A on J^. We prove that 
for all integer m, there exists an injection Tr^ from to {1, . . . ,n} such that iTmi0) = 1 
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and the projection of n~^/°^X on TTmiJm) converges weakly to The conckision of 

theorem 16.71 follows by extracting a sequence rrin — t- oo such that the latter holds. 

To construct such injection iTm, we explore the entries of X: we first consider the m 
largest entries of the vector in (C^)™, ((Xi2,X2i), . . . , whose indices are de- 

noted by ii,...,im- We then look at the m-largest entries of ((Xj^j, .^s^, 
whose indices are ii^i, . . . ,ii,m- We repeat this procedure iteratively until we have dis- 
covered \Jm\ indices, and we define the injection tt^ as TTmiv) = iv The fact that the 
restriction of n~^/"X to {iv)veJ,n converges weakly to yl|m can be proved by developing 
the ideas presented above. □ 

Continuity of quaternionic resolvent for local convergence. Note that theorem l6.7l will have 
a potential interest for us, only if we know how to link the local convergence of definition 
16.61 to the convergence of the quaternionic resolvent introduced in sub-section 14.61 

Recall that an operator i? on a dense domain D{B) is Hermitian if for all x,y ^ D{B), 
{x,By) = {Bx,y). This operator will be essentially self-adjoint if there is a unique self- 
adjoint operator Bi on D{Bi) D D(B) such that for all x € D{B), Bix = Bx (i.e. Bi is 
an extension oi B). 

Lemma 6.8 (From local convergence to resolvents). Assume that {An) and A satisfy the 
conditions of definition \6.b\ and (A„,n) — )• {A^v) for some u,^ G N. If the bipartized 
operator B of A is essentially self-adjoint, then, for all q G 

RA„{q)uu RA{q) vv ■ 

Proof. Fix z e C and let Bn{z) = Bn — q{z,0) iS" /, where S„ is bipartized operator 
of An. By construction, for all cp £ D{B) = V{H x Z/2Z), an^ Bn{z)an4' converges to 
B{z)4> (this is the strong operator convergence). The proof is then a direct consequence 
of \121\ Theorem VIII. 25(a)]: in this framework, the strong operator convergence implies 
the strong resolvent convergence. Namely, for all (j),^ £ D{B) and r] £ C+, 

(0, {an'Bn{z)an " 7?/)"^) ^ {4>, {B{z) - 7?/)-V>. 

We conclude by applying this to </>, ^/^ G {e^, ea]. □ 

Remark 6.9 (A non-self-adjoint Hermitian operator). A key assumption in the above 
lemma is the essential self-adjointness of the bipartized limit operator. A local limit of 
Hermitian matrices will necessary be Hermitian. It may not however be always the case 
that the limit is essentially self-adjoint. Since any bounded Hermitian operator is essen- 
tially self-adjoint, for an example, we should look for an unbounded operator. let (afc)fcgN 
be a sequence on and define on -D(N), for k >2, 

Bck = OkCk+i + ak-iCk-i- 

while Bei = 0162. In matrix form, B is a tridiagonal symmetric infinite matrix. The 
work of Stieltjes |141) implies that B will be essentially self-adjoint if and only if 

lim a7^ = 00. 

n-s>oo ^-^ 
k>n 

6.4. Skeleton of the main proofs. All ingredients have finally been gathered. The 
skeleton of proof of theorems 16. 2|, 16.31 and the characterization of and Z^Q, 2 IS clS follows: 

(1) By lemma 14.211 for all q G IHI+, a.s. , in norm, 

rn-i/-x(9) -iEr„-i/-x(9) ^ 

(2) Since X has exchangeable rows, for all q G BI_|_, 

lEr„-i/«x(9) = Ei?„-i/c.x(g)ii 
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(3) We prove in sub-section 16 . 51 that the bipartized operator B of the random operator 
A of sub-section 16.31 is a.s. essentially self-adjoint 

(4) It follows by theorem 16.71 and lemma [67 



lim KT^-i/cxiq) = KRAiq),^ 



(a{q) h{q)\ 
\h{q) a{q)) 



(5) By lemma H. 191 a.s. v^-x/ax-z ^ ^oi,z as n — )■ oo, where Ua,z is characterized by 

mu^.A'n) = a{q) 

(6) We know from sub-section 16.21 that statement {ii) of lemma 14. 2UI holds for n~^/"X 
in probability. Then, in probability, /U„-i/Q;jf ~^ /Xq, as n — )■ oo, where /Xq, is 
characterized by, in 2?'(C), 

1 

Ha = lim dh{q) 

Vr q{z,it):t],0 

(7) We analyze in sub-section 16.51 4 iq)r^r^ to obtain the properties of Va,z and /io-. 

(8) Finally, when X12 has a bounded density we improve the convergence to almost 
sure (in sub-section 16. 6p . 



6.5. Analysis of the limit operator. This sub-section is devoted to items [3] and [7] which 
appear above in the skeleton of proof of theorems 16.21 and 16.31 



Self-adjointness. Here we check the self-adjointness of the bipartized operator B of A. 

Proposition 6.10 (Self-adjointness of bipartized operator on PWIT). Let A be the ran- 
dom operator defined by (|6.5p . With probability one, B is essentially self-adjoint. 



This proposition relies on the following criterion of self-adjointness (see [26] for a proof). 

Lemma 6.11 (Criterion of self-adjointness of the bipartized operator). Let k > and 

T = {V,E) be an infinite tree on and {wuv,Wvu){v,u}£E be a collection of pairs of 
weight in C such that for all u £ V , 

\wuv\'^ + \wyu\'^ < 00. 

v:{u,v}£E 

Define the operator on T>{V) as 

ACu — ^ ^ U^vu^^v 
v:{u,v}£E 

Assume also that there exists a sequence of connected finite subsets {Sn)n>i in- ^ , such 
that Sn C Sn+i, UnS'n = V, and for every n and v G Sn, 



E 

:{u,v}eE 



I ^uv I H" I ^vu I ) < 1^- 



Then the bipartized operator B of A is essentially self-adjoint. 
We will use a simple lemma on Poisson processes (for a proof |25[ Lemma A.4]). 

Lemma 6.12 (Poisson process tail). Let k>0, 0<a<2 and let < xi < X2 < • • • be 

a Poisson process of intensity 1 on // we define 

T:=inf|tGN: x^'/"<kI 

I k=t+l ) 

then Er is finite and goes to as k goes to infinity. 
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Proof of proposition \6.1(K For k > and v G N-'^ , we define 

oo 

r^ = inf{t>0: ^ \yvk\~^''" < . 
k=t+l 

The variables (r^,) are i.i.d. and by lemma [6. 121 there exists k > such that Et„ < 1. We 
fix such K. Now, we put a green color to all vertices v such that t„ > 1 and a red color 
otherwise. We consider an exploration procedure starting from the root which stops at 
red vertices and goes on at green vertices. More formally, define the sub- forest of T 
where we put an edge between v and vk if u is a green vertex and 1 < k < t^. Then, if 
the root is red, we set = C^{T) = {0}. Otherwise, the root is green, and we consider 
Tg = (V^^,£'|) the subtree of that contains the root. It is a Galton- Watson tree with 
offspring distribution T0. Thanks to our choice of k, is almost surely finite. Consider 
the leaves of this tree (i.e. the set of vertices v in V0 such that for all 1 < A; < r^,, v/c 
is red) . The following set satisfies the condition of Lemma 16.111 

•Si = \J{l<k<T,: vk}. 

We define the outer boundary of {0} as 

a,{0} = {l,...,r4 
and for u = (ii, . . . , i^) G N''^\{0} we set 

c>t{v} = {(n, . . . , ik-i,ik + 1)} U {(ii, . . . , Zfc, 1), • • • , (n, • • • , ik,Tv)}. 
For a connected set 5, its outer boundary is 

drS= [[jdr{v]\\S. 

Now, for each vertex ui,...,Uk E drSi, we repeat the above procedure to the rooted 
subtrees T^^ , . . . , Tu,^ ■ We set 

'S'2 = 'S'l 1^ Ui<j<fcC''(T„.). 

Iteratively, we may thus almost surely define an increasing connected sequence (Sn) of 
vertices with the properties required for lemma [6.111 □ 

Computation of resolvent. As explained in section 16. 4| the properties of the measures fia 
and i^a^z can be deduced from the analysis of the limit resolvent operator. Resolvent are 
notoriously easy to compute on trees. More precisely, let T = {V, E) be a tree and A, B 
be as in lemma 16.111 and let G 1/ be a distinguished vertex of V (in graph language, we 
root the tree T at 0). For each v G y\{0}, we define Vy dV as the set of vertices whose 
unique path to the root contains v. We define T„ = {Vy.,Ey) as the subtree of T spanned 
by Vy. We may consider A^, the projection oi A onVy, and By the bipartized operator of 
Ay. Finally, we note that if B is self-adjoint then so is By{z) for every 2; G C. The next 
lemma is an operator analog of the Schur inversion by block formula (|4.12p . 

Lemma 6.13 (Resolvent on a tree). Let A,B be as in lemma V6.11\ If B is self-adjoint 
then for any q = q{z, r/) G IHI+, 

where RA{q)vv '■= ^vRb^{<i)^*v, o.'ri'd Rs^iq) = {By[z) — rj)"^ is the resolvent of By. 
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Ra{q)00 



We come back to our random operator A defined on the PWIT and its quaternionic 
resolvent Ra{(1)- We analyze the random variable 

'a{z,r]) h{z,'q)\ 
b'{z,r]) c{z,r])J' 

The random variables a{z, rj) solves a nice recursive distribution equation (RDE). This type 
of recursion equation is typical of combinatorial observable defined on random rooted trees. 
More precisely, we define the measure on M^., 

Lemma 6.14 (Recursive distribution equation). For all q = q{z,r]) £ IHI+, if Lq is the 
distribution on C-(- of a{z, rf) then Lg solves the equation in distribution: 



z 



2 



V + Ea,.>i Ckak ){v + Efc>i Cka'k 



(6.8) 



where a, {ak)keN o-n-d {a'f^)keN ^'"e i.i.d. with law Lg independent of {^^k}keN! {?fc}fceN two 
independent Poisson point processes on with intensity A^. 
Moreover, with the same notation, 

, d -z 



UP 



(6.9) 



Proof This is sa simple consequence of lemma I6.13[ Indeed, for /c G N, we define 
as the subtree of T spanned by kN-l" . With the notation of lemma 16.131 for A; G N, 
RBf^il) = {Bk{z) — r])~^ is the resolvent operator of Bf^ and set 

RA{q)kk = U,RBMK=(ll ll)- 
Then, by lemma (6. 131 we get 



'Efe(i-efc)yfe 



U + 







-1 / v + J2k^kyk 




-1 



Ck 



with 

\ k>l 

Now the structure of the PWIT implies that 
(j) ak and Ck have common distribution Lg 
(jj) the variables (afc,Cfe)fcgN are i.i.d. 
Also the thinning property of Poisson point processes implies that 

(jjj) {^kVk ^''"IfceN and {(1 — £k)yk ^^"IfceN are independent Poisson point process with 
common intensity A^. 

□ 

Even if ()6.8p looks complicated at first sight, for r] = it, it is possible to solve it explicitly. 
First, for t £ M+, a{z,it) is pure imaginary and we set 

h{z,t) = Jm{a{z,it)) = —ia{z,it) G {0,t^^]. 
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The crucial ingredient, is a well-known and beautiful lemma. It can be derived form 
a representation of stable laws, see e.g. LePage, Woodroofe, and Zinn pTOlj and also 
Panchenko and Talagrand [117^ Lemma 2.1]. 

Lemma 6.15 (Poisson-stable magic formula). Let {(,k}keN be a Poisson process with 
intensity A^. If (Y^) is an i.i.d. sequence of non-negative random variables, independent 

o/{^fc}fceN; such that E[y^^] < oo then 

km feeN 
where S is the positive ^-stable random variable with Laplace transform for all x > 0, 

Eexp(-x5) = exp (^-r(^l - (6.10) 

Proof of lemma \6.15[ Recall the formulas, for y > 0, > and < r/ < 1 respectively, 

r(r?)-W x'^-^e-'^^dx and y'^ = T (1 - r])-\ x'^'-^l - e-''y)dx. (6.11) 
Jo Jo 

From the Levy-Khinchin formula we deduce that, with s > 0, 

Eexp (^-sJ2^kYj}j = exp (e"^'"'^! - l)^x''^~^dx 



a 



exp -r 1-- s2E[yi 



□ 



Hence, by lemma [6.15|, we may rewrite ()6.8p as 



, A t + yS 

^- \,\2 + ^t + yS){t + yS'y ^'-'"^ 

where S and S' are i.i.d. variables with common Laplace transform ()6.10p and the function 
y = y(|z|2,i) = E[W2]2/" is the unique solution of the equation in y: 



1 = E 



ty-^ + S 



\z\'^ + {t + yS)it + yS'), 

(since the left hand side is decreasing in y, the solution is unique). In the above equations, 
it is also possible to consider the limits as t J, 0. 

As explained in section \6M this implies that, in T>'{C), is equal to 

--limE6(-,it). 

vr 40 

Using ()6.9p . after a simple computation, we find that the density Qa of fia at z is 



.(y2(|^|2)_2|^|V(kn2/:(N|2))E- 



' {\z\^ + yKmss'y 

where y*(r) = y{r) is the unique solution 

a 

\r + y^bb' ^ 

After more computations, it is even possible to study the regularity of find the explicit 
solution at 0, and an asymptotic equivalent as r — )■ oo. All these results can then be 
translated into properties of We will not pursue here these computation which are 
done in [2^. We may simply point out that /i^ converges weakly to the circular law as 
a —7- 2, is a consequence of the fact that the non-negative a/2-stable random variable 
5'/r(l — a/2y/°^ converges to a Dirac mass as a — )• 2 (see (|6.1Up l. 
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6.6. Improvement to almost sure convergence. Let 1^0- 2 be as in theorem 16.31 In 
order to improve the convergence to a.s., it is sufficient to prove that for all z £ C, a.s. 

/•CO 

lim Uf, (z) = L where L := - log{s) dua,z{s). 

We have already proved that this convergence holds in probability. It is thus sufficient to 
prove that there exists a deterministic sequence L„ such that a.s. 



hm iU^ (z)-Lj =0. (6.13) 

Now, thanks to the bounded density assumption and remark 14.171 one may use lemma 
14.121 for the matrix X — n^^^zl in order to show that that there exists a number 6 > 
such that a.s. for n ^ 1, 

s„(n-^/°X - zl) > n-^ 
Similarly, up to an increase of b if needed, we also get from ()6.2p that a.s. for n ^ 1, 

si(n-^/"X-z/) <n^ 

Now, we consider the function 

fn{x) = 1{„-0<|^|<„0} log(x). 

From what precedes, a.s. for n ^ 1, 

Uf, _^/^{z) = - I \og{s)di^^-i/c.x-ziis) = - fn{s)dv^-i/c.x-zi{s)- (6-14) 

JO JO 

The total variation of /„ is bounded by clogn for some c > 0. Hence by lemma 14.181 if 

Ln:=E J fn{s)diy^-i/cx-ziis), 



then we have, 



>t] < 2 exp 



fn{s)du^-i/c.x-ziis) - L 
In particular, from the first Borel-Cantelli lemma, a.s., 

}^ (/ f"-^^^ du^-^'^x-zii^) - = 0- 
Finally, using (|6.14|) . we deduce that (|6.13|) holds almost surely. 



nt^ 



(clog n)" 



7. Open problems 
We list in this section some open problems related to the circular law theorem. 

Universality of Gaussian Ensembles. The universality dogma states that if a real or com- 
plex functional of X is enough symmetric and depend on enough entries then it is likely 
that this functional behaves asymptotically (n — )■ 00) like in the Gaussian case (Ginibre 
Ensemble here) as soon as the first moments of Xn match certain Gaussian moments 
(depends on the functional). This can be understood as a sort of non-linear central limit 
theorem. Among interesting functionals, we find for instance the following: 

• Spectral radius (has Gumbel fluctuations for the Complex Ginibre Ensemble); 

• argument of Ai(X) (is uniform on [0,27r] for the Complex Ginibre Ensemble); 

• Law of A„(n~^/^X) (see [49, Chapter 15] for the Complex Ginibre Ensemble). The 
square of the smallest singular value Sn(n~^/^G)^ of the Complex Ginibre Ensemble 
follows an exponential law [ID] and this result is asymptotically universal [149| : 

• Gap probabilities and Voronoi cells (see [3] and [52] for the Ginibre Ensemble); 

• Linear statistics of f^x (some results are available such as |124t \l'2b\ [271 1123| ) ; 
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• Empirical distribution of the real eigenvalues of n^'^^'^X when Xn is real (tends 
to uniform law on [— 1 , 1] for the Real Ginibre Ensemble) ; 

• Unitary matrix in the polar decomposition (Haar unitary for the Complex Ginibre). 
This is connected to the R-diagonal concept in free probability theory [75]; 

• If Xii has infinite fourth moment then the eigenvalues of largest modulus blow up 
and are asymptotically independent (Poisson statistics at some scale); 

• A large deviations principle for /zx at speed ii? which includes as a special case 
the one obtained for the Complex Ginibre Ensemble by Hiai and Petz |119j (see 
also Ben Arous and Zeitouni [17j) and references therein. The analogous question 
for Hermitian models (Wigner and CUE) is also open. The answer depends on the 
chosen scale, the class of deviations, and the topology. 

One may group most of these functionals by considering the spectrum as a point process. 
It is also possible to consider universality beyond i.i.d. entries models. For instance, 

2 

if X has exchangeable entries as a random vector of C" and if X satisfies to suitable 
mean-variance normalizations, then we expect that E/Lix tends to the circular law due to a 
Lindeberg type phenomenon, see [M] for the Hermitian case (Wigner). Similarly, if X, as 

2 

a random vector of , is log-concave (see footnote [22]) and isotropic (i.e. its covariance 
matrix is identity) then we expect that E//x tends to the circular law, see [T] for i.i.d. 
log-concave rows. Since the indicator of a convex set is a log-concave measure, one may 
think about the Birkhoff polytope (convex envelope of permutation matrices) and ask if 
the circular law holds for random uniform doubly stochastic matrices, see |32j and ^35] . 

Variance profile. We may consider the matrix Y defined as Yij = Xija{i/N, j /N) where 
a : [0, 1]^ — )• [0, 1] is a measurable function. The measure /i„-i/2y should converge a.s. to 
a limit probability measure /io- on C. For finite variance Hermitian matrices, this question 
has been settled by Khorunzhy, Khoruzhenko, Pastur and Shcherbina |95) . for heavy tailed 
Hermitian matrices, by Belinschi, Dembo, Guionnet |15] . Girko has also results on the 
singular values of random matrices with variance profile. 

Elliptic laws. We add some dependence in the array {Xij)ij>i : we consider an infinite 
array {Xij , Xji)i<i^j<n of i.i.d. pairs of complex random variables, independent of {Xii)i>i 
an i.i.d. sequence of random variables. Assume that Var(Xi2) = Var(X2i) = 1 and 
Cor(Xi2,X2i) = t £ {z £ C : \z\ <1}. There is a conjectured universal limit for ix^-\/2x 
computed by Girko |55] . called the elliptic law. This model interpolates between Hermitian 
and non- Hermitian random matrices. When X = y^(l + r) /2Hi + isj (1 — r) jlH^^ with 
< r < 1 and Hi,H2 two independent GUE, this model has been carefully analyzed by 
Bender in |19j , see also Ledoux [100| , Johansson |90] , and Khoruzhenko and Sommers [96] . 

Oriented r -regular graphs and Kesten-McKay measure. Random oriented graphs are host 
of many open problems. For example, for integers n > r > 3, an oriented r-regular graph 
is a graph on n vertices such that all vertices have r incoming and r outgoing oriented 
edges. Consider the adjacency matrix yl of a random oriented r-regular graph sampled 
from the uniform measure (there exists suitable simulation algorithms using matchings of 
half edges). It is conjectured that as n — )• oo, a.s. fiA converges to the probability measure 

1 r^(r — 1) 

-(^2 _ |^|2)2i{i^i<v?} ^^^y- 

It turns out that this probability measure is also the Brown measure of the free sum of r 
unitary, see Haagerup and Larsen [75| . The Hermitian (actually symmetric) version of this 
measure is known as the Kesten-McKay distribution for random non-oriented r-regular 
graphs, see [9 ^ llllj . We recover the circular law when r — )• oo up to renormalization. 
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Invertibility of random matrices. The invertibility of random matrices is one of the keys 
behind the circular law theorem 12.21 Let us consider the case were Xn is Bernoulli 
^{5-i+Si). A famous conjecture by Spielman and Teng (related to their work on smoothed 
analysis of algorithms [14^ I139| ) states that there exists a constant < c < 1 such that 



for n S> 1 and any small enough t >0. This was almost solved by Rudelson and Vershynin 
[128] and Tao and Vu In particular, taking t = gives P(s„(X) = 0) = c". This 

positive probability of being singular does not contradict the asymptotic invertibility since 
by the first Borel-Cantelli lemma, a.s. Sn(X) > for n ^ 1. Regarding the constant c 
above, it has been conjectured years ago that 



This intuition comes from the probability of equality of two rows, which implies that 
¥(sniX) = 0) > (1/2)". Many authors contributed to the analysis of this difficult nonlin- 
ear discrete problem, starting from Komlos, Kahn, and Szemeredi. The best result to date 
is due to Bourgain, Vu, and Wood [29j who proved that P(s„(X) = 0) < (l/\/2 + o(l))". 

Roots of random polynomials. The random matrix X has i.i.d. entries and its eigenvalues 
are the roots of its characteristic polynomial. The coefficients of this random polynomial 
are neither independent nor identically distributed. Beyond random matrices, let us con- 
sider a random polynomial P{z) = oq + aiz + • • • + a^z" where oq, . . . , are independent 
random variables. By analogy with random matrices, one may ask about the behavior as 
n ^ oo of the roots Ai(P), . . . , A„(P) of P in C and in particular the behavior of their 
empirical measure ^ X]r=i '^Ai(P)- The literature on this subject is quite rich and takes its 
roots in the works of Littlewood and Offord, Rice, and Kac. We refer to Shub and Smale 
|136j . Azai's and Wschebor [9], and Edelman and Kostlan [43\ [Hj for (partial) reviews. 
As for random matrices, the case where the coefficients are real is more subtle due to the 
presence of real roots. Regarding the complex case, the zeros of Gaussian analytic func- 
tions is the subject of a recent monograph [83] in connection with determinantal processes. 
Various cases are considered in the literature, including the following three families: 

• Kac polynomials, for which (aj)o<i<n are i.i.d. 

• Binomial polynomials, for which Oj = \J~^bi for all i and (^i)o<i<n i.i.d. 

• Weyl polynomials, for which Oj = -^^j for all i and (&j)o<i<n ^-^-d- 

Geometrically, the complex number z is a root of P if and only if the vectors (1, z, . . . , z"') 
and (oo, oT, . . . , a^) are orthogonal in C'^"'"^, and this connects the problem to Littlewood- 
Offord type problems |102j and small balls probabilities. Regarding Kac polynomials, 
Kac |92|, 191) has shown in the real Gaussian case that the asymptotic number of real 
roots is about ^ log(n) as n — )• oo. Kac obtained the same result when the coefficients 
are uniformly distributed [93] . Hammersley [77] derived an explicit formula for the k- 
point correlation of the roots of Kac polynomials. The real roots of Kac polynomials 
were extensively studied by Maslova [1091 HQS] , Ibragimov and Maslova [851 EZl [881 ES] , 
Logan and Shepp [1041 1105] . and by Shepp and Farahmand [132J. Shparo and Shur [135j 
have shown that the empirical measure of the roots of Kac polynomials with light tailed 
coefficients tends as n — )• cxd to the uniform law one the unit circle {zGC:|z| = 1} (the arc 
law). Further results were obtained by Shepp and Vanderbei {133j . Zeitouni and Zelditch 
|157| . Shiffman and Zelditch |134j . and by Bloom and Shiffman [2T]. If the coefficients 
are heavy tailed then the limiting law concentrates on the union of two centered circles, 
see [HB] and references therein. Regarding Weyl polynomials, various simulations and 
conjectures have been made [S21I1S|. For instance, if (6i)o<i<n are i.i.d. standard Gaussian, 



P(\/ns„(X) <t)<t + c' 



n 



P(, 
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it was conjectured that the asymptotic behavior of the roots of the Weyl polynomials is 
analogous to the Ginibre Ensemble. Namely, the empirical distribution of the roots tends 
as n — >• c« to the uniform law on the centered disc of the complex plane (circular law), 
and moreover, in the real Gaussian case, there are about -^fn real roots as n — t- oo and 
their empirical distribution tends as n — )• oo to a uniform law on an interval, as for the 
real Ginibre Ensemble, see Remark 13.81 The complex Gaussian case was considered by 
Leboeuf [98] and by Peres and Virag |118j . while the real roots of the real Gaussian case 
were studied by Schehr and Majumdar [131]. Beyond the Gaussian case, one may try 
to use the companion matriji[3 of P and the logarithmic potential approach. Numerical 
simulations reveal strange phenomena depending on the law of the coefficients but we 
ignore it they are purely numerical. Note that if the coefficients are all real positive then 
the roots cannot be real positive. The heavy tailed case is also of interest (rings?). 

Appendix A. Invertibility of random matrices with independent entries 

This appendix is devoted to the proof of a general statement (lemma lA.ll below) on 
the smallest singular value of random matrix models with independent entries. It follows 
form lemma lA. II below that \iX = {Xij)^^^ is a random matrix with i.i.d. entries such 
as X\\ is not constant and E(|Xii|'*) < oo for some arbitrarily small real number k > 0, 
then for any 7 > there exists are real number /? > such that for any n ^ 1 and any 
deterministic matrix M G A4„(C) with si(M) < rC , 

lim P(s„(X + M) < n-^) = 0. 

Both the assumptions and the conclusion are strictly weaker than the result of Tao and Vu. 
It is enough for the proof of the circular law in probability and its heavy tailed analogue. 

Lemma A.l (Smallest singular value of random matrices with independent entries). // 

(Xjj)i<jj<„ is a random matrix with independent and non-constant entries in C and if 
a > is a positive real number such that 



b := min ¥{\Xij\ < a) > and a := min Y&r[ Xijlnx- iKa}] > 0) 
then there exists c = c{a, b,a) > 1 such that for any M £ 7W„(C), n>c, s>l,0<t<l, 
p(^s„(X + M) < ; si{X + M)<s^ < c^y\og{cs) (^ts'^ + -^^ . 

The proof of lemma \A.1\ follows mainly from [1U'6\ I128j . These works have already been 
used in the proof of the circular law, notably in [UB]. As we shall see, the term l/^/n 
comes from the rate in the Berry-Esseen Theorem. Following |lU3j . it could probably be 
improved by using finer results on the Littlewood-Offord problem [148]. Note however, 
that lemma [A. II is sufficient for proving convergence in probability of spectral measures. 

We emphasize that there is not any moments assumption on the entries in lemma lA.ll 
However, (weak) moments assumptions may be used in order to obtain an upper bound 
on the quantity ¥{si{X + M) > s). Also, the variance (of the truncated variables) a may 
depend on n : this allows to deal with sparse matrix models (not considered here). 

For the proof of the circular law and its heavy tailed analogue, lemma lA. II can be used 
typically with t = l/{s'^^/n) and s = rf large enough such that with high probability 
si{X + M) < s. In contrast with the Tao and Vu result, lemma [A. II cannot provide a 
summable bound usable with the first Borel-Cantelli lemma due to the presence of l/i/n. 

Let us give the idea behind the proof of lemma lA.ll A geometric intuition says that 
the smallest singular value of a random matrix can be controlled by the minimum of the 



^"^The companion matrix Af of Q{z) := co + ciz + ■ ■ ■ + Cn-12" ^ + is tlie n x n matrix with null 
entries except Mi^i+i = 1 and M„^i — Ci-i for every i. The characteristic polynomial of M is Q. 
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distances of each row to the span of the remaining rows. The distance of a vector to a 
subspace can be controlled with the scalar product of the vector with a unit norm vector 
belonging to the orthocomplement of the subspace. Also, when the entries of the matrix 
are independent, this boils down by conditioning to the control of a small ball probability 
involving a linear combination of independent random variables. The coefficients in this 
combination are the components of the orthogonal vector. The asymptotic behavior of 
this small ball probability depends in turn on the structure of these coefficients. When 
the coefficients are well spread, we expect an asymptotic Gaussian behavior thanks to the 
central limit theorem, more precisely its quantitative weighted version called the Berry- 
Esseen theorem. We will follow this scheme while keeping the geometric picture in mind. 

The proof of lemma PV. II is divided into two parts which correspond to a subdivision of 
the unit sphere S"~^ of C". Namely, for some real positive parameters 5,p > that will 
be fixed later, we define the set of sparse vectors 

Sparse := {x E C" : card(supp(x)) < 5n} 

and we split the unit sphere S""^ into a set of compressible vectors and the complementary 
set of incompressible vectors as follows: 

Comp := {x G S"^"*^ : dist(x. Sparse) < p} and Incomp := \ Comp. 

We will use the variational formula, for A G A^„(C), 

Sn{A) = min ||j4x||2 = min ( min ||j4x||2, min ||j4x||2). (A.l) 

Compressible vectors. Our treatment of compressible vectors differs significantly from [103[ 
1128] (it gives however a weaker statement). We start with a variation around lemma [4. 131 

Lemma A. 2 (Distance of a random vector to a small subspace). There exist £,c,6o > 
such that for all n ^ 1, all 1 < i < n, any deterministic vector v £ C" and any subspace 
H of C" with 1 < dim{H) < don, we have, denoting C := {Xu, . . . , X„j) + v, 

F{dist{C,H) < ecr^n) < cexp{-ccr^n). 

Proof. First, from Hoeffding's deviation inequality, 

nb\ / nb^\ 

^(El{l^..l<a}< yl <exp(^-— J. 

It is thus sufficient to prove the result by conditioning on 

Em ■= {l-'^iil < a, . . . , \Xmi\ < a} with m := [n6/2] . 

Let Em[-] := IE[- \Em',J~m] denote the conditional expectation given Em and the filtration 
Tm generated by Xm+i,i, ■ ■ ■ , Xn^i- Let W be the subspace spanned by H, v, and the 
vectors u := (0, . . . , 0, Xm+i,i, ■■■ , Xn,i) and 

w:= {E[Xu I \Xu\<a],...,E[Xmi \ \Xmi\ < a],0, . . . ,0) . 

By construction dim(H^) < dim(f/') + 3 and W is J>„-measurable. We note also that 

dist(C, H) > dist(C, W) = dist(y, W), 

where 

Y:= {Xii-E[Xii I \Xii\ <a],...,Xmi-^Xm. \ \Xmi\ < a],0, . . . ,0) = C - u - v - w. 
By assumption, for 1 < k < m, 

EmYk = and Em\Yk\^ > a\ 
Let D = {z : \z\ < a}. We define the function / : — ^ M+ by 

fix) = dist((xi,...,Xm,0, ... ,0),PF). 
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This function is convex and 1-Lipschitz. Hence, Talagrand's concentration inequality gives 

Fmi\dist{Y,W) - Mm\ >t)< 4exp 
where is the median of / under P^- In particular, 



16a2 / ' 



Mm > yE™dist2(y, W) - ca. 
Also, if P denotes the orthogonal projection on the orthogonal of W^, we find 



kk 

k=l 



Vfc=l k=m+l / 

> a'^{n — d\m(H) — 3 — (n — m)) 



2 1 



> a^l^y -dim(F) -3 

The latter, for n large enough, is lower bounded by ca'^n if 6q = 6/4. □ 
Let < e < 1 and s > 1 be as in lemma lA. 21 We set from now on 

(in particular, p < 1/4). The parameter < 5 < 1 is still to be specified: at this stage, 
we simply assume that 5 < 6o. We note that if ^ G A^„(C) and y £ C" is such that 
supp(y) C vr C {1, . . . , n}, then we have 

\\Ay\\^ > ||y||2S„(^|^), 

where A^^ is the n x |7r| matrix formed by the columns of A selected by vr. We deduce that 

3 

min IIAxIL > - min Sn(A\^) — psi(A). (A. 2) 

rreComp" " 4 vrC{l,...,n}:|7r| = L-5nJ ^ FiV / \ J 

However, by Pythagoras theorem, for any x G C''^', 



I^k^|l2 



> max |xjpdist^(Cj, i^j) > min dist^(Cj, -ffj)-j — r 



I 

2 ^tzTT 



where Ci is the i-th column of A and Hi := spanjCj : J G vr, j 7^ i}. In particular, 



Sn{A\T,) > mindist(Ci,Fi)/v|7r|. 

Now, we apply this bound to A = X + M. Since Hi has dimension at most nS and is 
independent of Ci, by lemma [A. 21 the event that, 

mindist(Cj, i/j) > ea^fn, 
has probability at least 1 — cn(5 exp(— ca^n) for n ^ 1. Hence 

Therefore, using the union bound and our choice of p, we deduce from \K.2\ 
min \\{X^M)x\\^ < ; si(X + M) < < cf , ,V'^e"'"''' 
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with H{5) := —5log6 — {1 — 5) log(l — 5). Therefore, if 6 is chosen small enough so that 
H{6) < cc7^/2, we have proved that for some ci > 0, 



min ||(X + M)x||2 < 3^ ; si(X + M) < s ) < exp(-cin), (A.3) 



(note that the constant ci depends on a). From now on, we fix 

6 



logcj| ' 

with C2 small enough so that 5 < 6q and H{5) < ca'^/2. 

Incompressible vectors: invertibility via distance. We start our treatment of incompressible 
vectors with two key observations from [128| . 

Lemma A.3 (Incompressible vectors are spread). Let x G Incomp. There exists a subset 
TT C {1, . . . ,n} such that \tt\ > 6n/2 and for all i £ tt, 

^ < \Xi\ < 




In 

Proof. For vr C {1, . . . , n}, we denote by the orthogonal projection on span{e.j; i G vr}. 
Let TTi = {k : \xk\ < y^2/{6n)} and tt2 = {k : \xk\ > pj^fn). Since ||j;||2 = 1, we have 

bn 
~2' 

Note also that 

\\X Pty2x\\2 = ||^'''2"^ll2 — P' 

Hence, the definition of incompressible vectors implies that |7r2| > 6n. We put vr = 7rin7r2. 
From what precedes, 



I I \ I CI I CI ^ dn 6n 
\tt\ > n — Ittj^I — iTTgl > n [n — on) = —. 



□ 



Lemma A. 4 (Invertibility via average distance). Let A be a random matrix taking its 
values in 7W„(C), with columns Ci, . . . , C„, and for some arbitrary 1 < k < n, let be 
the span of all these columns except Ck- Then, for any t > 0, 

f( min IMxIU < < VP(dist(Cfc,i?fc) < 0- 

yxeincomp ^Jn J on ^-^ 

Proof. Let x G S"~^, from Ax = C^Xk, we get 

IIAxlU > max dist(Ax, Hh) = max \xk\dist(Ck, Hb). 

^ l<fc<n l<fc<n 

Now if X G Incomp and vr is as in lemma IA.3| we get 

||j4x||2 > -7= maxdist(Cfc, i/fc). 

^/n fcSTT 

Then, the conclusion follows from the fact that for any reals yi, . . . , y„ and 1 < m < n, 
l{maxi<fe<„j/fe<t} < ~''^^{yk<t} — ~ '^{yk<t}- 

k=l k=l 

n 

The strength of lemma lA.41 lies in the fact that the control of ||^2;||2 over all incom- 
pressible vectors is done by an average of the distance between the columns of A. 
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Incompressible vectors: small ball probability. Now, we come back to our matrix X + M: 
let C be the k-th column oi X + M and H be the span of all columns but C. Our goal in 
this sub-section is to establish the bound, for all t >0, 



P(dist(C, H)<pt; si{X + M)<s)< ^ ^ T + ^ ' ' ^"^'^^ 



To this end, we also consider a random vector C taking its values in S"^^ n H-^, which is 
independent of C. Such a random vector is not unique, we just pick one and we call it 
the orthogonal vector (to the subspace H). We have 

dist(C,i/) > KC,C)|. (A.5) 

Lemma A.5 (The random orthogonal vector is Incompressible). For our choice of p,6 
and c\ as in ()A.3p . we have 

e Comp ; si(X + M) < s) < exp(-cio-^n). 



Proof. Let A € 7V4n-i,n(C) be the matrix obtained from {X + M)* by removing the k-ih 
row. Then, by construction : AC = 0, si{{X + M)*) = si{X + M), and 

min ||^x||2 > min ||(X + M)*x||2. 

xSComp xSComp 

The left hand side (and thus the right hand side) is zero if C ^ Comp. In particular, 



P(C € Comp;si(X + M) < s) < P min \\{X + MYxW^ = ; si{{X + M)*) < s 

y a;GComp 

It remains to notice that obviously (|A.3p holds with {X + M) replaced by {X + M)* . 
Indeed the statistical assumptions are the same on X + M and (X + M)* . □ 

We have reached now the final preparation step before the use of the Berry-Esseen 
theorem. This step consists in the reduction to a case where for a fixed set of coordinates, 
both the components of C and the random variables Xi^ + Mik are well controlled. Namely, 
if C G Incomp, let vr C {l,...,n} be as in lemma [A. 31 associated to vector Then 
conditioned on {(, £ Incomp}, from Hoeffding's deviation inequality, the event that 

Inlb 6bn 



^'^{\X,k\<a} > — > 

k2 . 



has conditional probability at least 1 — exp(— |7r|6 /2) > 1 — exp(— C(5n) (recall that Q hence 
vr are independent of C). In summary, using our choice of J, /?, by lemma lA.51 and (|A.5p . 
in order to prove (|A.4|) . it is sufficient to prove that for all t > 0, 



P™(l(C.C)l<P*)<^V^(t + ;ij 



where Pm(0 = '^{■\Em-, ^m) is the conditional probability given Fm the cr-algebra generated 
by all variables but (Xi^, . . . , X^fc), fn = [(5&ri/4j , and the event 



{^<IC.I</J;i<^<-}U{I^^' 



E^={^< \Ci\ < \/— ;1 ^ ^ ^ rn}[}{\Xik\ <a;l<i<m}. 
We may write 



■ u 

1=1 i=l 

where u G J^m is independent of {Xi^, ■ ■ ■ , AT^fc). It follows that 



m,c)\<pt)< sup 

zeC,7rC{l,.--,m} 



<pt\. (A.6) 
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The idea, originated from [103] , is now to use the rate of convergence given by the Berry- 
Esseen theorem to upper bound this last expression. 

Lemma A. 6 (Smah ball probability via Berry-Esseen theorem). There exists c > such 
that if Zi, . . . , Zn are independent centered complex random variables, then for all t > 0, 



supl 

zee 



i=l 



<t\< 



Ct 



+ 



2^^3/2- 



Proof Let = EiLi^l^iP, then either Er=iE(^cZi)2 or E"=iIEpmZ,)2 is larger or 
equal to Also 



i=l 



<t \ < 



i=l 



< t 



and similarly with Dim. Hence, up to loosing a factor 2, we can assume with loss of 
generality that the Zj's are real random variables. Then, if G is a real centered Gaussian 
random variable with variance r^, Berry-Esseen theorem asserts that 



sup 



^Zi<t \ -F{G<t] 



■,i=l 



<coT-3/2^E(|Z, 



i=l 



In particular, for all t > and x G M, 



i=l 



<t \ < F{\G-x\ < t) + 2cor-^/^^E(|Zi 



2 = 1 



We conclude by using the fact that G has a density upper bounded by l/V2TrT^. □ 
Define L = ^ log2 Note that for our choice of p, 5, for some constant c = c(a, 6), 

L < c|log p\. 

For 1 < J < I/, we define 

1 <i<m: — — < \Ci\ < — 
n \/n 



22j-2pV2|7r,-| 



From the pigeonhole principle, there exists j such that \7rj\ > m/L. We have 
and, 



n 



We deduce by ()A.6p and lemma [A. 61 that (by changing the value of c), for all t > 0, 



< c /^ogpl 

- C7 V 5 



The proof of (|A.4p is complete. 
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Proof of lemma lA.li All ingredient have now been gathered. By lemma [A. 41 and (|A.4p we 
find, for all t > 0, 

min ||(X + m)x||2 < ^ ; si{X + M) < sV^) < - 

xelncomp J CJ V O'^ \ 



Jn 

Using our choice of p, 5, we obtain for some new constant c = c(a, 6, a) > 0, 

min ||(X + m)x||2 < ; si{X + M) < s ) < c\/log(cs) ( ts^ + ^ 

as-elncomp -y/n / \ -y/n 

The desired result follows then by using (|A.ip and (|A.3p . □ 
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